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BIOSYNTHESIS OF POLYKETIDE SYNTHASE SUBSTRATES 

Technical Field 

[0001] The invention relates to methods to adapt microbial hosts for efficient 
production of polyketides. In one aspect, the hosts are modified to synthesize the starter 
5 and/or extender units used by polyketide synthases in the synthesis of polyketides. In 
another aspect, hosts are modified to synthesize synthases which accept substituted 
benzoates as starter units. Other host modifications may also be made. Thus, the invention 
includes methods for production of complex polyketides in such diverse organisms as 
Escherichia coli, Bacillus, Myxococcus, and Streptomyces. 

10 Background Art 
Rifamycin B 

[0002] The rifamycin synthetase is primed with a 3-amino-5-hydroxybenzoate (AHB) 
starter unit by a loading module that contains domains homologous to the adenylation (A) 
and thiolation (T) domains of nonribosomal peptide synthetases. The rifamycin synthetase 

15 of Amycolatopsis mediterranei is responsible for the biosynthesis of prosansamycin X, a 
precursor to the antibiotic rifamycin B (Figure 1). (The protein complex responsible for 
biosynthesis of prosansamycin X, a rifamycin B precursor, is referred to herein as 
rifamycin synthetase because the results described herein establish that ATP is required for 
covalent attachment of the aryl starter unit to the loading module of the complex.) The 

20 rifamycin synthetase consists of a core of five multifunctional proteins, RifA, RifB, RifC, 
RifD, and RifE, in addition to RifF, a protein that is believed to cyclize the linear product 
of the other proteins via intramolecular amide formation (Schupp, T., et al 9 FEMS 
Microbiol. Lett (1998) 159:201-207; August, P. R., et al, Chem. Biol (1998) 5:69-79; 
Tang, L., et al, Gene (1998) 216:255-265; Floss, H. G., et al, Curr. Opin. Chem. Biol 

25 (1 999) 3:592-597). The five multifunctional proteins can be further subdivided into one 
nonribosomal peptide synthetase (NRPS)-like loading module and ten polyketide synthase 
(PKS) modules, based on sequence homology to other systems. 

[0003] RifA, the N-terminal protein component of rifamycin synthetase, contains an 
NRPS-like module, the adenylation-thiolation (A-T) loading didomain, upstream of the 

30 first condensing module (Figure 1). The first such A-T type loading module was identified 
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in the gene cluster for the natural product rapamycin (Schwecke, T., et al 9 Proc. Natl 
Acad, Set USA (1995) 92:7839-7843). Complete gene clusters for other synthetases that 
contain hybrid modular interfaces have since been reported (Gehring, A. M., et al 9 Chem. 
Biol (1998) 5:573-586; Quadri, L. E. N., et al, Chem. Biol (1998) 5:631-645; 
5 Silakowski, B., et al 9 J. Biol Chem. (1999) 274:37391-37399; Julien, B., et al, Gene 

(2000) 249:153-160.; Tillett, D., et al. Chem. Biol (2000) 7:753-764; Wu, K., et al, Gene 
(2000) 251:81-90; Du, L., et al, Chem. Biol (2000) 7:623-640), and these synthetases 
produce hybrid natural products that are composed of both ketide and peptide units. The 
proven track record of polyketide and peptide natural products as therapeutics suggests that 

10 the increased combinatorial diversity embodied in hybrid products will advance drug 
discovery. It would be advantageous to use a biochemical understanding of hybrid 
synthetases coupled with the ability to manipulate hybrid interfaces through protein 
engineering to enable the potential of such hybrid molecules to be realized. 

[0004] The NRPS-like A-T didomain of RifA presumably primes the synthetase with 

1 5 3-amino-5-hydroxybenzoate (AHB), which has been shown to be the precursor of the 

mC?N structural element of rifamycin B (Figure 1) (Ghisalba, O., et al J. Antibiot. (1981) 
34:64-71; Anderson, M. G., etal t J. Chem. Soc. Chem. Commun. (1989) 311-313). 
However, the mechanism of this priming has not been established. Two alternative models 
can be envisioned. In the coenzyme A (CoA) ligase model prevalent in the literature 

20 (Schupp, T., et al, FEMS Microbiol. Lett. (1998) 159:201-207; August, P. R., et al, Chem. 
Biol (1998) 5:69-79; Ghisalba, O., et al y J. Antibiot. (1981) 34:64-71), the activated AHB- 
adenylate product of the A domain is attacked by CoA to generate an AHB-CoA 
intermediate, and the aryl thioester enzyme intermediate results from transthiolation onto 
the T domain (Figure 2A). In an alternative mechanism, which has been confirmed as 

25 detailed below, that is analogous to the mechanism used to prime NRPS modules, AHB is 
activated as the aryl-adenylate by the A domain, and the thiol of the phosphopantetheine 
cofactor of the T domain attacks AHB-adenylate directly to form a covalent aryl thioester 
enzyme intermediate (Figure 2B). 

[0005] Although AHB is the natural substrate of the A-T didomain, previous in vivo 

30 studies have revealed that RifA can be primed by the alternative substrates 3- 

hydroxybenzoate (3-HB) and 3,5-dihydroxybenzoate (Hunziker, D., et al, J. Am. Chem. 
Soc. (1998) 120:1092-1093). It would be advantageous to harness this innate substrate 
tolerance for its implications for the production of unnatural natural products. In one 
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aspect, it would be advantageous to reconstitute the activity of the A-T didomain of 
rifamycin synthetase in vitro in order to establish the mechanism of this priming module 
and to systematically investigate its substrate tolerance. Thus, the invention provides 
homologous substituted substrates for the production of unnatural natural products. 

5 6-Deoxyerythronolide B 

[0006] Erythromycin, a broad spectrum antibiotic synthesized by the bacterium 
Saccharopolyspora erythraea 9 is a prototype of a class of complex natural products called 
polyketides (OUagan, D., The Polyketide Metabolites (Ellis Horwood, Chichester, U.K., 
1991). Complex polyketides such as 6-deoxyerythronolide B (6-dEB), the macrocyclic 

10 core of the antibiotic erythromycin, constitute an important class of natural products. 
These biomolecules are synthesized from simple building blocks such as acetyl-CoA, 
propionyl-CoA, malonyl-CoA and methylmalonyl-CoA through the action of large modular 
megasynthases called polyketide synthases (Cane, D. E., et al, Science (1998) 282:63), 
generally found in actinomycetes. For example, the polyketide synthase (PKS) which 

15 results in the synthesis of 6-dEB is produced in Sacromyces erythraea. The polyketides 
produced in these native hosts are generally subsequently tailored to obtain the finished 
antibiotic by glycosylation, oxidation, hydroxylation and other modifying reactions. 
Polyketide structural complexity often precludes the development of practical laboratory 
synthetic routes, leaving fermentation as the only viable source for the commercial 

20 production of these pharmaceutically and agriculturally useful agents. At the same time, the 
challenges associated with developing scalable and economically feasible fermentation 
processes for polyketide production from natural biological sources (principally the 
Actinomyces family of bacteria) are enormous, and represent the most serious bottleneck 
during polyketide pre-clinical and clinical development. Recent work from this laboratory 

25 has demonstrated that it is possible to express polyketide synthase modules in a functional 
form in Escherichia coli (Gokhale, R. S., et al, Science (1999) 284:482-485). However, in 
order to harness these modular enzymes for polyketide biosynthesis in E. coli 9 or in other 
hosts that do not normally produce them it is also necessary to produce their appropriate 
substrates in vivo in a controlled manner. For example, metabolites such as acetyl-CoA, 

30 propionyl-CoA, malonyl-CoA and methylmalonyl-CoA are the most common substrates of 
these enzymes. E, coli has the capability to produce acetyl-CoA, propionyl-CoA, and 
malonyl-CoA; however, the latter two substrates are only present in small quantities in the 
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cell, and their biosynthesis is tightly controlled. The ability of £. coli to synthesize 
methylmalonyl-CoA has not been documented thus far. 

[0007] Similar conditions prevail in other microbial cells, especially those that do not 
natively produce polyketides, such as various species of Escherichia, Bacillus, Rhizobium, 
5 Pseudomonas, and Flavobacterium. Thus, generally, the required starter and/or extender 
units may not be produced in adequate amounts in any particular host. Further, by 
appropriate selection of the acyl transferase (AT) domains of the PKS in question, 
substrates more complex than those just mentioned may be employed. As an example, the 
PKS for synthesis of FK506 comprises an acyl transferase domain that incorporates 

10 substrates such as propyl malonyl-CoA in preference to malonyl-CoA or methylmalonyl- 
CoA. It would be helpful to have available a method which provides this range of 
substrates in appropriate levels in any arbitrarily chosen host organism. 

[0008] Additional problems that may need to be surmounted in effecting the production 
of polyketides in procaryotic hosts, especially those which do not natively produce 

15 polyketides, include the presence of enzymes which catabolize the required starter and/or 
extender units, such as the enzymes encoded by the prp operon of E. coli, which are 
responsible for catabolism of exogenous propionate as a carbon and energy source in this 
organism. In order to optimize production of a polyketide which utilizes propionyl CoA as 
a starter unit and/or utilizes its carboxylation product, methylmalonyl CoA as an extender 

20 unit, this operon should be disabled, except for that portion (the E locus) which encodes a 
propionyl CoA synthetase. Any additional loci which encode catabolizing enzymes for 
starter or extender units are also advantageously disabled. 

[0009] In addition, a particular procaryotic host, such as E. coli, may lack the 
phosphopantetheinyl transferase required for activation of the polyketide synthase. It may 

25 be required to modify the host to contain such a transferase as well. 

[0010] Recent studies in several Streptomyces spp. have demonstrated the beneficial 
impact of a thioesterase-like enzyme on polyketide productivity (Butler, A. R., et al, 
Chem. Biol (1999) 6(5);287-292; and Tang, L, et al, Chem. Biol (1999) 6(8):553-558). 
Homologs of these enzymes (termed thioesterase II or TEII) are encoded within the 

30 biosynthetic gene clusters of many macrolides. The mechanism of the Saccharopolyspora 
erythraea TEII is unclear, and it has been proposed that these enzymes play an editing role 
in polyketide biosynthesis (Butler, supra; and Heathcote, M. L., et al. Chemistry & 
Biology (2001) 8:207-220. 
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(0011] In summary, it would be advantageous to effect the production of polyketides in 
microbial, especially procaryotic, hosts in general, including hosts which do not natively 
produce polyketides. These latter hosts often have advantages over native polyketide 
producers such as Streptomyces in terms of ease of transformation, ability to grow rapidly 
5 in culture, and the like. These advantages are particularly useful in assessing the results of 
random mutagenesis or gene shuffling of polyketide synthases. Thus, the invention 
provides a multiplicity of approaches to adapt microbial hosts for the production of 
polyketides. 

Disclosure of the Invention 

10 [0012] In one aspect, the invention has achieved, for the first time, the production of a 
complete complex polyketide product, 6-dEB, in the ubiquitously useful host organism, 
E. colL The methods used to achieve this result are adaptable to microbial hosts in general, 
especially procaryotes. They can be used to adapt microbial hosts which do not natively 
produce polyketides to such production and to enhance the production of polyketides in 

15 hosts that normally produce them. Depending on the host chosen, the modifications 
required may include incorporation into the organism of expression systems for the 
polyketide synthase genes themselves; disabling of endogenous genes which encode 
catabolic enzymes for the starter and/or extender units; incorporation of expression systems 
for enzymes required for post translational modification of the synthases, such as 

20 phosphopantetheinyl transferase; and incorporation of enzymes which enhance the levels of 
starter and/or extender units. The particular combination of modifications required to adapt 
the host will vary with the nature of the polyketide desired and with the nature of the host 
itself. 

[0013] Thus, in one aspect, the invention is directed to microbial host cells which are 
25 genetically modified for enhanced synthesis of at least one polyketide wherein said 

modification comprises incorporation of at least one expression system for producing a 
protein that catalyzes the production of starter and/or extender units and/or disabling at 
least one endogenous pathway for catabolism of starter and/or extender units. In a 
particularly preferred embodiment, these modifications are made in the genome of the host 
30 organism, as this is advantageous for large-scale production of polyketides. 

[0014] In another aspect, the invention is directed to the production of a polyketide 
product comprising substituted benzoates used as starter unit substrates for an A-T loading 
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didomain of a rifamycin synthetase to make modified polyketides in organisms such as 
E. coli. In still another aspect, the invention includes a screening method to determine 
which substituted benzoate derivatives are viable substrates for an A-T didomain. 

[0015] Additional modifications may also be made, such as incorporating at least one 
5 expression system for a polyketide synthase protein and, if necessary, incorporating at least 
one expression system for a phosphopantetheinyl transferase. Additional expression 
systems can also be employed, if necessary, for further modification of synthesized 
polyketides such as hydroxylation, glycosylation, and the like. Also, an expression system 
for TEII may be employed. Again, it is often advantageous, although not necessary, to 
10 incorporate these expression systems into the genome of the host. 

[0016] In other aspects, the invention is directed to methods of preparing polyketides, 
including complete polyketides, in the modified cells of the invention. A preferred 
embodiment is a method to synthesize 6-dEB, 6-dEB analogs or other complete polyketides 
in E. colL 

15 [0017] In still another aspect, the invention is directed to a method to assess the results 
of gene shuffling or random mutagenesis of polyketide synthase genes by taking advantage 
of the high transformation efficiency of E. coli. An assay for polyketide production is also 
contemplated. 

Brief Description of the Drawings 

20 [0018] Figure 1 is a proposed biosynthetic scheme for prosansamycin X, a precursor to 
rifamycin B. The rifamycin synthetase consists of a core of five large multifunctional 
proteins, RifA, RifB, RifC, RifD, and RifE, each containing one or more PKS modules. 
Each PKS module catalyzes one cycle of .chain extension and associated fi-ketoreduction 
for the biosynthesis of prosansamycin X. The N-terminal A-T loading didomain of RifA 

25 primes the synthetase with AHB and is reminiscent of a minimal NRPS module. The 
location of the mC?N unit derived from AHB is shown in bold in the prosansamycin X 
structure. The active sites denote adenylation (A), thiolation (T), acyltransferase (AT), 
ketosynthase (KS), B-ketoreductase (KR), or dehydratase (DH) domains. As indicated, 
RifF is believed to catalyze cyclization via intramolecular amide formation. 

30 [0019] Figure 2 illustrates possible mechanisms for the A-T loading didomain. (A) In 
the CoA ligase model, the activated AHB-adenylate product of the A domain is attacked by 
CoA to generate an AHB-CoA intermediate, and the aryl thioester enzyme intermediate 
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results from transthiolation onto the T domain. (B) In the NRPS-like mechanism, AHB is 
activated as the AHB-adenylate by the A domain, and the thiol of the phosphopantetheine 
cofactor of the T domain attacks AHB-adenylate directly to form a covalent aryl thioester 
enzyme intermediate. 

5 [0020] Figure 3 is a chart showing the presence or absence of apo or holo A-T 
didomain, ATP, and [ 14 C]-B or [ 14 C]-3-HB based on the results of the ATP-dependent 
covalent loading of the holo A-T didomain with B and 3-HB is shown based on 
Coomassie-stained gel (4-15% gradient) of the reaction mixtures and an autoradiograph of 
this gel (not shown). 

10 [0021] Figure 4 graphs the high performance liquid chromatography (HPLC) traces of 
time courses of reactions containing the apo A-T didomain. No net formation of benzoyl- 
CoA is observed. Labeled peaks were identified by co-injection with authentic standards of 
CoA, B, and benzoyl-CoA. The HPLC traces were shifted progressively by 0.15 min. 
[0022] Figure 5 graphs the saturation curves for covalent loading of the holo A-T 

1 5 didomain by 3-HB (□) or B (O). Figure 5 A is a linear representation of the data. 

Figure 5B is a logarithmic representation of the data to facilitate evaluation of both data 
sets simultaneously. The lines are best fits of the data to a simple saturation model and 
give *cat = 1 .9 min" 1 and K M = 1 80 for 3-HB, and ^ at = 0.14 min 1 and Km = 170 |iM 
forB. 

20 [0023] Figure 6 shows the two plasmids used a synthetic operon approach to facilitate 
the expression of the DEBS and pec genes. The restriction sites are abbreviated as follows: 
X, Xbal; N, Ndel; E, EcoRI; H, Hindm; B, Bpul 1021; Ns, Nsil; Ps, PstI; P, Pad; D, 
Dralll. 

[0024] Figure 7 A is a schematic of the 6-deoxyerythronolide B synthase (DEBS). The 
25 catalytic domains are: KS, ketosynthase; AT, acyl transferase; ACP, acyl carrier protein; 
KR, ketoreductase; ER, enoyl reductase; DH, dehydratase, TE, thioesterase. DEBS utilizes 
1 mole of propionyl-CoA and 6 moles of (2S)-methylmalonyl-CoA to synthesize 1 mole of 
6-deoxyerythronolide B (6dEB, compound !)• Figure 7B illustrates that truncated 
DEBS1+TE produces the triketide lactone (compound 2). Figure 7C illustrates that the 
30 rifamycin synthetase is a polyketide synthase that is naturally primed by a nonribosomal 
peptide synthetase loading module, comprised of two domains- an ATP dependent 
adenylation domain (A) and a thiolation domain (T). Substitution of this A-T didomain in 
place of the loading didomain of DEBS yields an engineered "hybrid" synthase that utilizes 
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exogenous acids such as benzoic acid to synthesize substituted macrocycles such as 
compound 3 in an engineered strain of E. coli. 

[0025] Figure 8 is a schematic of the genetic design of E. coli BAPL 

[0026] Figure 9 shows the production of 6dEB in E. coli. Cellular protein content and 
5 6dEB concentration are plotted versus time. 

[0027] Figures 10A and 10B show E. coli Fed Batch Fermentation Experiments. 
Figure 10A shows: No additional propionate added over time. Figure 10B shows: With 
additional propionate feeding as specified in the Materials and Methods section. 

[0028] Figure 1 1 shows E. coli 6dEB Fed-Batch Fermentation including TEH gene 
10 expression. 

Modes of Carrying Out the Invention 

[0029] With regard to one illustrative aspect of the invention, in the illustrative 
example below, E. coli is modified to effect the production of 6-dEB, the polyketide 
precursor of erythromycin. The three proteins required for this synthesis, DEBS1, DEBS2 

15 and DEBS3 are known and the genes encoding them have been cloned and sequenced. A 
multiplicity of additional PKS genes have been cloned and sequenced as well, including 
those encoding enzymes which produce the polyketide precursors of avermectin, 
oleandomycin, epothilone, megalomycin, picromycin, FK506, FK520, rapamycin, tylosin, 
spinosad, and many others. In addition, methods to modify native PKS genes so as to alter 

20 the nature of the polyketide produced have been described. Production of hybrid modular 
PKS proteins and synthesis systems is described and claimed in U.S. patent 5,962,290. 
Methods to modify PKS enzymes so as to permit efficient incorporation of diketides is 
described in U.S. patent 6,080,555. Methods to modify PKS enzymes by mixing and 
matching individual domains or groups of domains is described in U.S. Serial 

25 No. 09/073,538. Methods to alter the specificity of modules of modular PKS's to 

incorporate particular starter or extender units are described in U.S. Serial No. 09/346,860, 
now allowed. Improved methods to prepare diketides for incorporation into polyketides is 
described in U.S. Serial No. 09/492,733. Methods to mediate the synthesis of the 
polyketide chain between modules are described in U.S. Serial No. 09/500,747. The 

30 contents of the foregoing patents and patent applications are incorporated herein by 
reference. 
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[0030] Thus, a selected host may be modified to include any one of many possible 
polyketide synthases by incorporating therein appropriate expression systems for the 
proteins included in such synthases. Either complete synthases or partial synthases may be 
supplied depending on the product desired. If the host produces polyketide synthase 
5 natively, and a different polyketide from that ordinarily produced is desired, it may be 
desirable to delete the genes encoding the native PKS. Methods for such deletion are 
described in U.S. patent 5,830,750, which is incorporated herein by reference. 

[0031] For hosts which do not natively produce polyketides, the enzymes that tailor 
polyketide synthases may be lacking or deficient, so that in addition to supplying the 

1 0 expression systems for the polyketide synthases themselves, it may be necessary, 

depending on the host chosen, to supply an expression system for these enzymes. One 
enzyme which is essential for the activity of PKS is a phosphopantetheinyl transferase. 
The genes encoding these transferases have been cloned and are available. These are 
described in U.S. patent application 08/728,742, which is now published, for example, in 

1 5 Canadian application 2,232,230. The contents of these documents are incorporated herein 
by reference. 

[0032] A particular selected host may natively include genes which produce proteins 
that catabolize desired starter and/or extender units. If such a host is selected, it may be 
advantageous to disable the production of proteins which catalyze such catabolism, as an 

20 important aspect of the invention is to modify the host so that the levels of required starter 
and/or extender units are enhanced. One non-limiting example of a catabolic system 
includes the prp operon wherein the proteins encoded by subunits A-D catabolize 
exogenous propionate. The enzyme encoded by prp E is desirable however as it is a 
propionyl CoA synthetase. The portions of the operon encoding catabolizing enzymes are 

25 advantageously disabled in modifying E. colu Similar operons in other hosts may be 
disabled as needed. 

[0033] An assay can be used to determine polyketide production in a cell that is unable 
to carry out propionate catabolism or anabolism by adding labeled propionate and 
separating it from polyketide that has been produced. 
30 [0034] In one aspect of the invention, enzymes that enhance or enable the production of 
starter and/or extender units, and any enzymes required for activation of these production 
enzymes are incorporated into the cells by modifying them to contain expression systems 
for these proteins. As stated above, these expression systems may be supplied as 
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extrachromosomal replicating vectors, but are preferably integrated into the host genome. 
When integrated into the genome, the necessity for selection pressure is mitigated and a 
stable production organism is obtained. 

[0035] In one illustrative embodiment of this aspect, advantage is taken of the mat ABC 
5 operon, which was recently cloned from Rhizobium trifoli (An, J.H., et al f Eur. J. Biochem. 
(1988) 15:395-402). This operon provides enzymes which permit the production of both 
extender and starter units commonly used in the synthesis of polyketides. Although it 
results normally in the production of malonyl-CoA and acetyl-CoA, the specificity of the 
enzymes encoded is not strict and thus, for example, propionyl-CoA and methylmalonyl- 
10 Co A can also be produced. This particular operon is used only for illustration, and genes 
encoding comparable enzymes can be found in a multiplicity of other organisms. Those of 
skill in the art are able to access such enzymes through homology searches in publicly 
available databases. Thus, this aspect of the invention is not to be regarded as limited to 
the mat ABC operon from Rhizobium trifoli only, but to extend to comparable systems 
15 available from other organisms. 

[0036] There are three proteins encoded by this operon. 

[0037] Mat A encodes a malonyl-CoA decarboxylase, which normally catalyzes the 
reaction: malonyl-CoA -» acetyl-CoA +C0 2 . 

[0038] MatB encodes a malonyl-CoA synthetase which catalyzes the reaction: malonic 
20 acid +CoASH -> malonyl-CoA (in an ATP dependent reaction). 

[0039] MatC encodes a malonate transporter which is believed to be responsible for 
transport of malonic acid across the cell membrane. 

[0040] These enzymes are demonstrated herein to be somewhat promiscuous with 
respect to substrate in their ability to catalyze the reactions shown. Thus, in addition to 
25 malonyl-CoA and malonic acids (for MatA and MatB respectively) as substrates, these 
enzymes can also utilize methylmalonyl-CoA and methylmalonic acid; ethylmalonyl-CoA 
and ethylmalonic acid; propylmalonyl-CoA and propylmalonic acid and the like. Thus, 
these enzymes can be used to provide a variety of starter and extender units for synthesis of 
desired polyketides. 

30 [0041] As stated above, homologs of this operon are also contemplated. For example, 
homologs oimatB and matC derived from S. coelicolor (GenBank accession 
No. AL1 63003) can be used. 
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[0042] Also useful in supplying an extender unit is the gene encoding propionyl CoA 
carboxylase. In one example of this activity, the carboxylase enzyme is a dimer encoded 
by the pccB and accA2 genes which have been characterized from Streptomyces coelicolor 
A3 by Rodriguez, E:, et al, Microbiology (1999) 145:3109-31 19. Methods of making 
5 2S-methylmalonyl CoA using homologs of accA2 and pccB genes is also contemplated. A 
biotin ligase encoded natively in this case by the accAl gene is needed for activation of 
these proteins. Alternative biotin ligases could also be used. The typical substrate for this 
enzyme is propionyl-CoA which is converted to methylmalonyl-CoA; a reaction which is 
summarized as 

1 0 propionyl-CoA+C0 2 ^methylmalonyl-Co A (an ATP dependent reaction). 

[0043] Other acyl-CoA substrates may also be converted to the corresponding 
malonyl-CoA products by this enzyme and its homologs. Again, homologs of these genes 
can readily be found by those of skill in the art using standard searching techniques on 
publicly available databases. The biotin ligase, in particular, need not necessarily be 

1 5 derived from the same genetic complement as the carboxylase itself. 

[0044] In addition to providing modified host cells that are efficient in producing 
polyketides, the polyketide synthases, their activation enzymes, and enzymes which 
provide starter and/or extender units can be used in in vitro systems to produce the desired 
polyketides. For example, the enzymes malonyl-CoA decarboxylase and/or malonyl-CoA 

20 synthetase such as those encoded by the mat ABC operon and/or propionyl-CoA 

carboxylase such as that encoded by the pccB and accA2 genes can be used in in vitro 
cultures to convert precursors to suitable extender and starter units for a desired PKS to 
effect synthesis of a polyketide in a cell-free or in in vitro cell culture system. Purified 
MatB is particularly advantageously used for the preparative cell free production of 

25 polyketides, since CoA thioesters are the most expensive components in such cell-free 
synthesis systems. Alternatively, as set forth above, these genes are used (in any suitable 
combination) in a general strategy for production by cells in culture of these substrates. 
MatB and MatC can be used to effect production of any alpha-carboxylated CoA thioester 
where the corresponding free acid can be recognized as a substrate by MatB. The MatA 

30 protein may also be used to supplement in vitro or in vivo levels of starter units such as 

acetyl-CoA and propionyl-CoA. The genes encoding propionyl-CoA carboxylase can also 
be used to provide the enzyme to synthesize suitable extender units in vivo. 
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(00451 The host may have a repertoire of other enzymes, such as the biotin ligase birh, 
which provides additional support. 

(0046] In general, the invention contemplates providing microbes that contain extender 
and/or starter units useful in polyketide synthases where the host either makes insufficient 
5 quantities of these materials for efficient synthesis or does not make detectable quantities ol 
them at all. For example, E. coli does not produce detectable amounts of S-methylmalonyl 
CoA, a required intermediate in 6dEB biosynthesis. Thus, the invention is directed 
specifically in one aspect to E. coli which are modified to contain S-methylmalonyl CoA. 
Similarly, other organisms are modified to contain additional polyketide intermediates such 

10 as ethylmalonyl CoA, methoxymalonyl CoA or hydroxymalonyl CoA of the appropriate 
stereochemistry for incorporation into polyketides. These hosts are distinguished from 
their unmodified forms by either the presence of, or by increased levels of, these polyketide 
synthase substrates and can be verified to contain these modifications by acquiring the 
ability to synthesize the relevant polyketides or to synthesize them at an enhanced level. 

1 5 [00471 In one general aspect, the invention is directed to modified host organisms 
which contain enhanced levels of PKS substrates or which contain them per se in the 
instance wherein the unmodified host does not contain detectable levels. 

[0048] The invention also includes a method to enhance or enable the production of a 
polyketide, including a complete complex polyketide in a microbial host, which method 

20 comprises culturing a modified host which has been provided with at least one expression 
system for an enzyme which enhances or enables the production of starter and/or extender 
units used in constructing the polyketide. A "complete" polyketide is a polyketide which 
forms the basis for an antibiotic, such as the polyketides which are precursors to 
erythromycin, megalomycin, and the like. The enzymes which are provided include but are 

25 not limited to those encoded by the mat ABC operon and their homologs in other organisms 
as well as the pccB and accA2 genes (and accAl) encoding propionyl carboxylase and their 
homologs in other organisms. In another aspect, the invention is directed to a method of 
enhancing or enabling production of polyketides in cell-free systems by providing one or 
more of these enzymes to the cell-free system; 

30 [0049] The invention is also directed to cells modified to produce the enzymes and to 
methods of producing polyketides using these cells, as well as to methods of producing 
polyketides using cell-free systems. 



- 12- 



WO 02/068613 



PCT/US02/06399 



(0050] The invention also includes a method to enhance or enable polyketide 

production in a microbial system by supplementing the medium with a substrate for an 

endogenous enzyme which converts this substrate to a starter or extender unit. 

[0051 J The invention also includes a method to produce polyketides in microbial hosts 

5 containing modifications to assist polyketide production, such as disarming of the 

endogenous genes which encode proteins for catabolism of required substrates, by 

supplying these cells with synthetic precursors, such as diketide precursors. 

[0052J The polyketide produced may be one normally produced by the PKS and may 

exist in nature; in this case the presence of the gene encoding the starter/extender 

10 production-enhancing enzyme in vivo or of the enzyme itself in cell free systems may 

simply enhance the level of production. In addition, the PKS may be a modified PKS 

designed to produce a novel polyketide, whose production may be enhanced in similar 

fashion. Because of the ability of the enzymes described herein to accept a wide range of 

substrates, extender units and starter units can be provided based on a wide range of readily 

1 5 available reagents. As stated above, diketide starting materials may also be supplied. 

[0053] The invention thus also includes the various other modifications of microbial 

hosts described above to permit or enhance their production of polyketides and to methods 

of producing polyketides using such hosts. 

[0054] The ability to modify hosts such as E. coli and other procaryotes such as 

20 Bacillus to permit production of polyketides in such hosts has numerous advantages, many 

of which reside in the inherent nature of E. coli. One important advantage resides in the 

ease with which E. coli can be transformed as compared to other microorganisms which 

natively produce polyketides. One important application of this transformation ease is in 

assessing the results of gene shuffling of polyketide synthases. Thus, an additional aspect 

25 of the invention is directed to a method to assess the results of polyketide synthase gene 

shuffling which method comprises transfecting a culture of the E. coli modified according 

to the invention with a mixture of shuffled polyketide synthases and culturing individual 

colonies. Those colonies which produce polyketides contain successfully shuffled genes. 

[0055] In addition to modifying microbial hosts, especially procaryotic hosts, to 

30 produce polyketides, these hosts may further be modified to produce the enzymes which 

"tailor" the polyketides and effect their conversion to antibiotics. Such tailoring reactions 

include glycosylation, oxidation, hydroxylation and the like. Organisms, preferably E. coli, 

which are modified to contain one or more polyketide modification enzymes, such as those 
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relating to p450, sugar biosynthesis and transfer, and methyl transferase are also 
contemplated. 

[0056] With regard to the nature of the hosts, £. coli is used as an important illustration. 
However, a variety of procaryotic hosts may also be useful. For example, in addition to 
5 E. coli, Bacillus, Salmonella, Rhizobium, Pseudomonas, and generally gram negative 
bacteria are useful. Other procaryotes may be used, and even eucaryotic microbes such as 
yeast. The advantages of the use of Saccharomyces cerevisiae, for example, as a 
production organism are well known. In addition, organisms such as Streptomyces which 
. natively produce polyketides may be used as hosts since the production of polyketides in 

10 these organisms may be enhanced by the methods of the invention. This is particularly the 
case where the host, which natively produces a polyketide, has been modified so that 
alternative extender units or alternative starter units are incorporated. In this case, even 
though the host natively makes polyketides, it may not have detectable levels of the starter 
and/or extender units now required for the modified PKS. 

1 5 [00571 As will be apparent to the skilled artisan, the precise nature of the manner of 

enhancing the levels of, or providing useable amounts of starter and/or extender units not 
natively produced will depend on the nature of the starter and/or extender units required. 
Typically, most complex polyketides require at least malonyl CoA or methylmalonyl CoA 
of the appropriate stereochemistry. Additional types of substituted malonyl CoA 

20 derivatives may be required in particular instances. The appropriate choice of expression 
system to provide the enzymes for production of such substrates or the inactivation of 
endogenous enzymes which may catabolize desired substrates is dependent on the nature of 
the polyketide to be synthesized and the nature of the host. Indeed, although typically the 
requisite expression systems for synthesis enzymes are provided from heterologous 

25 sources, in some instances, the native host may itself contain genes encoding the 

appropriate enzymes wherein these genes are not expressed. One example of this is the 
"sleeping beauty" operon in E. colL 

[0058] To effect production of the polyketides in a microbial host, it is preferable to 
permit substantial growth of the culture prior to inducing the enzymes which effect the 

30 synthesis of the polyketides. Thus, in hosts which do not natively produce polyketides, the 

required expression systems for the PKS genes are placed under control of an inducible 

promoter, such as the T7 promoter which is induced by isopropyl-p-D- 

thiogalactopyranoside (IPTG). There is a plethora of suitable promoters which are 
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inducible in a variety of such microbial hosts. The skilled artisan will be aware of 
promoters which are inducible under conditions that are compatible with polyketide 
production and which are appropriate for a particular host suggested. Other advantageous 
features of the modified host, such as the ability to synthesize starters or extenders, may 
5 also be under inducible control. Finally, precursors to the starting materials for polyketide 
synthase may be withheld until synthesis is desired. Thus, for example, if the starting 
materials are derived from propionate, propionate can be supplied at any desired point 
during the culturing of the cells. If a diketide or triketide starting material is used, this too 
can be withheld until the appropriate time. Prior to addition of the precursor, a minimal 
10 medium may be used and alternate carbon sources employed to supply, energy and 
materials for growth. 

[0059] The advantages of controlling the timing of production of substances such as 
polyketides are well known. Typically, the growth of the energy and metabolite 
requirements for the synthesis of such secondary metabolites impairs the ability of the 

15 organism to proliferate rapidly. Accordingly, it is advantageous to allow the organism to 
undergo exponential growth prior to inducing the production of a secondary metabolite. 

[0060] The success of applying the methods of the invention to provide useable or 
enhanced levels of substrate for polyketide synthase can conveniently be measured by the 
levels or rate of polyketide produced. Thus, the methods of the invention will enhance the 

20 level of production of polyketides by at least 5%, more preferably 10%, more 

preferably 25% and more preferably at least 50% as compared to production when the host 
is not modified under the same conditions. 

[0061] As described above, the invention provides methods for both in vitro and in vivo 
synthesis of any arbitrarily chosen polyketide where the in vivo synthesis may be conducted 

25 in any microbial, especially procaryotic host. The procaryotic host is typically of the genus 
Bacillus, PseudomonaSy Flavobacterium, or more typically Escherichia, in particular 
E. coli. Whether in vitro or in vivo synthesis is employed, it may be necessary to supply 
one or more of a suitable polyketide synthase (which may be native or modified), one or 
more enzymes to produce starter and/or extender units, typically including converting the 

30 free acid to the CoA derivative, and, if the foregoing enzymes are produced in a host, 

tailoring enzymes to activate them. In addition, for in vivo synthesis, it may be necessary 
to disarm catabolic enzymes which would otherwise destroy the appropriate starting 
materials. 
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[0062] With respect to production of starting materials, the genes of the mat ABC 
operon and the genes encoding propionyl carboxylase can be employed to produce their 
encoded proteins for use in cell free polyketide synthesis and also to modify recombinant 
hosts for production of polyketides in cell culture. These genes and their corresponding 
5 encoded products are useful to provide optimum levels of substrates for polyketide 

synthase in any host in which such synthesis is to be effected. The host may be one which 
natively produces a polyketide and its corresponding antibiotic or may be a recombinantly 
modified host which either does not natively produce any polyketide or which has been 
modified to produce a polyketide which it normally does not make. Thus, microorganism 

10 hosts which are useable for the synthesis of polyketides include various strains of 

Streptomyces, in particular S. coelicolor and S. lividans, various strains of Myxococcus, 
industrially favorable hosts such as E. coli, Bacillus, Pseudomonas or Flavobacterium, and 
other microorganisms such as yeast. These genes and their corresponding proteins are 
useful in adjusting substrate levels for polyketide synthesis generally. 

1 5 [0063] The methods of the invention can be used to enhance the production of a 

specific polyketide by a microbe. If the microbe does not natively produce the polyketide, 
but contains the relevant PKS genes only by virtue of recombinant manipulation, without 
the methods of the invention to provide starter and/or extender units, the production of 
polyketide may not proceed at all. Thus, the enhancement is essentially infinite (only 

20 because the base is 0). However, where detectable levels of polyketides are produced, 
either by virtue of prior modifications or because the microbe is natively able to produce 
them, the methods of the invention will enhance the levels of production by at least 5%, 
preferably 25%, more preferably 50%, and more preferably 100%, 200%, 5 fold, 10 fold,. 
100 fold or 200 fold. An extremely wide variation in the levels of polyketides produced is 

25 available depending on the starting point and the modifications made. 

Substrate Specificity and Polyketide Design 

[0064] These genes and their products are particularly useful because of the ability of 
the enzymes to utilize a range of starting materials. Thus, in general, propionyl 
carboxylase converts a thioester of the formula R 2 -CH-CO-SCoA, where each R is H or an 
30 optionally substituted alkyl or other optionally substituted hydrocarbyl group to the 
corresponding malonic acid thioester of the formula R2C(COOH)COSCoA. Other 
thioesters besides the natural co-enzyme A thioester may also be used such as the N-acyl 
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cysteamine thioesters. Similarly, the product of the matB gene can convert malonic acid 
derivatives of the formula R2C(COOH)2 to the corresponding acyl thioester, where each R 
is independently H or optionally substituted hydrocarbyl. A preferred starting material is 
that wherein R is alkyl (1-4C), preferably RCH(COOH) 2 . For in vivo systems, it may be 
5 advantageous to include the matC gene to ensure membrane transport of the starting 

malonic acid related material. The mat A gene encodes a protein which converts malonyl- 
CoA substrates of the formula R 2 C(COOH)COSCoA to the corresponding acyl-CoA of the 
formula R 2 CHCOSCoA, where R is defined as above, for use as a starter unit. 

[0065] Typically, the hydrocarbyl groups referred to above are alkyl groups of 1-8C, 

10 preferably 1-6C, and more preferably 1-4C. The alkyl groups may be straight chain or 
branch chain, but are preferably straight chain. The hydrocarbyl groups may also include 
unsaturation and may further contain substituents such as halo, hydroxyl, methoxyl or 
amino or methyl or dimethyl amino. Thus, the hydrocarbyl groups may be of the formula 
CH 3 CHCHCH 2 ; CH 2 CHCH 2 ; CH 3 OCH 2 CH 2 CH 2 ; CH 3 CCCH 2 ; CH 3 CH 2 CH 2 CH 2 CH 2 ; and 

15 the like. 

[0066] Also prepared by the invention methods and materials are extender units of the 
formula 

ROCH(COOH)COSCoA 
wherein R is defined as set forth above. 

20 [0067] The substituted alkyl groups are also 1-8C in the backbone chain, 

preferably 1-6C and more preferably 1-4C. The alkenyl and alkynyl hydrocarbyl groups 
contain 2-8C, preferably 2-6C, and more preferably 2-4C and may also be branched or 
straight chain, preferably straight chain. 

[0068] Further variability can be obtained by supplying as a starting material a suitable 

25 diketide. The diketide generally of the formulas such as those set forth in U.S. Serial 
No. 09/31 1,756 filed 14 May 1999 and incorporated herein by reference. A variety of 
substituents can then be introduced. Thus, the diketide will be of the general formula 
R'CH 2 CHOHCR 2 COSNAc wherein R is defined as above, and R' can be alkyl, 1-8C, aryl, 
aryl alkyl, and the like. SNAc represents a thioester of N-acetyl cysteamine, but alternative 

30 thioesters could also be used. 

[0069] For either in vivo or in vitro production of the polyketides, acyl transferase 

domains with desired specificities can be incorporated into the relevant PKS. Methods for 

assuring appropriate specificity of the AT domains is described in detail in U.S. Patent 
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Application 09/346,860 filed 2 July 1999, the contents of which are incorporated herein by 
reference, to describe how such domains of desired specificity can be created and 
employed. Also relevant to the use of these enzymes in vitro or the genes in vivo are 
methods to mediate polyketide synthase module effectiveness by assuring appropriate 
5 transfer of the growing polyketide chain from one module to the next. Such methods are 
described in detail in U.S. Serial No. 09/500,747 filed 9 February 2000, the contents of 
which are incorporated herein by reference for this description. 

[0070] As a preliminary matter in determining which substituted benzoates can serve as 
starter units, adenylation and thiolation activities of the loading module were reconstituted 

10 in vitro and shown to be independent of coenzyme A, countering literature proposals that 
the loading module is a coenzyme A ligase as shown in Example 7. Kinetic parameters for 
covalent arylation of the loading module were measured directly for the unnatural 
substrates benzoate (B) and 3-hydroxybenzoate (3-HB) as described in Example 8. This 
analysis was extended through competition experiments to determine the relative rates of 

1 5 incorporation of a series of substituted benzoates as described in Examples 9 and 10. The 
results in the examples show that the loading module can accept a variety of substituted 
benzoates, although it exhibits a preference for the 3-, 5-, and 3,5-disubstituted benzoates 
that most closely resemble its biological substrate. The remarkable substrate tolerance of 
the loading module of rifarnycin synthetase suggests that the module is useful as a tool for 

20 generating substituted derivatives of natural products. 

[0071] Substituted benzoates are defined as benzoate molecules that include any 
substituent or substituents. Benzoate substrates is a subset of substituted benzoates that 
primes an A-T didomain of a rifarnycin synthase or otherwise can be incorporated as a 
starter unit into a loading module or as an extender unit into a module of a synthase or a 

25 synthetase. Preferably the benzoate substrates include 3-, 5-, and 3,5-disubstituted 

benzoates. More preferably, the benzoates are selected from the group consisting of 2- 
aminobenzoate, 3-aminobenzoate, 4-aminobenzoate, 3-amino-5-hydroxybenzoate, 3- 
amino-4-hydroxybenzoate, 4-amino-2-hydroxybenzoate, 3-bromobenzoate, 3- 
chlorobenzoate, 3,5-diaminobenzoate, 3,5-dibromobenzoate, 3,5-dichlorobenzoate, 3,5- 

30 difluorobenzoate, 2,3-dihydroxybenzoate, 3,5-dihydroxybenzoate, 3,5-dinitrobenzoate, 3- 
fluorobenzoate, 2-hydroxybenzoate, 3-hydroxybenzoate, 4-hydroxybenzoate, 3- 
methoxybenzoate, 3-nitrobenzoate, and 3-sulfobenzoate. 
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{0072] The observation that CoA is not required for arylation of the T domain and that 
benzoyl-CoA is not a competent intermediate in this process establishes the loading module 
of rifamycin synthetase as an NRPS-like A-T didomain (Figure 2B). 

[00731 The conclusion that the loading module of rifamycin synthetase functions as an 
5 NRPS-like A-T didomain has implications for other systems. Biosynthetic gene clusters 
for rapamycin (Lowden, P. A. S., et al,Anges. Chem. Int. Ed. Engl (1996) 35:2249-2251), 
FK506 (Motamedi, H., et al, Eur. J. Biochem. (1998) 256:528-534), ansatrienin (Chen, S., 
et al, Eur. J. Biochem. (1999) 261 :98-107, FK520 (Wu, K., et al, Gene (2000) 251 :81-90), 
microcystin (Tillett, D., et al, Chem. Biol (2000) 7:753-764), and pimaricin 

10 (Aparicio, J. F., et al, Chem. Biol (2000) 7:895-905) all encode loading modules with 

homology to the A-T didomain of rifamycin synthetase. However, several of these systems 
have been proposed to be primed by an activated CoA substrate, presumably generated via 
a CoA ligase mechanism analogous to that shown in Figure 2A. (Schwecke, T., et al Proc. 
Natl Acad. Sci. USA (1995) 92:7839-7843; Motamedi, H., et al, Eur. J. Biochem. (1998) 

15 256:528-534; Moore, R. E., et al, J. Am. Chem. Soc, (1991) 1 13:5083-5084.) A more 
likely mechanism for priming of these systems is the adenylation-thiolation mechanism 
operative for rifamycin synthetase. 

[0074] Although the mechanisms shown in Figure 2 are distinct, the chemistries 
involved are essentially the same. In both cases activation of AHB occurs via the aryl- 

20 adenylate, and the only difference is whether or not there is intermediate transfer of AHB to 
CoA prior to arylation of the T domain. Because the phosphopantetheine cofactor of the 
T domain is derived from CoA, the thiol nucleophiles of the T domain and CoA are 
chemically equivalent. Therefore, it is not difficult to envision how an enzyme could 
evolve from a CoA ligase into an A-T didomain, simply by covalent incorporation of the 

25 nucleophilic end of CoA as a phosphopantetheine cofactor. There is presumably an 
advantage to covalently tethering the aryl substrate moiety to the synthetase via the 
T domain instead of noncovalently binding it as the aryl-CoA. Nevertheless, aryl-CoA 
ligases are known to be involved in polyketide synthesis in the plant kingdom (see, for 
example, Beerhues, L., FEBSLett. (1996) 383:264-266; Barillas, W., et al, Biol Chem. 

30 (2000) 381:155-160), and benzoyl-CoA appears to be a substrate of the iterative type II 

PKS that produces enterocin (Hertweck, C, et al, Tetrahedron (2000) 56:91 15-9120). 

[0075) Prior to this investigation, AHB, 3-HB, and 3,5-dihydroxybenzoate were known 

to be substrates of the A-T didomain (Hunziker, D., et al., J. Am. Chem. Soc. (1998) 
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120:1092-1093). Eleven additional substrates, including benzoate (B), have been identified 
herein (Table 1). Previous work suggests that the substrate tolerance of the A-T didomain 
of rifamycin synthetase for alternative substituted benzoates is shared to a degree by related 
bacterial benzoyl-CoA ligases (Geissler, J. F., et ai, J. Bad (1988) 170:1709-1714; 
5 Altenschmidt, U., J. Bact. (1991) 173:5494-5501); and EntE (Rusnak, R., et aL 9 

Biochemistry (1989) 28:6827-6835), a stand-alone A domain that is a component of the 
enterobactin synthetase. These proteins are able to accept several alternative substituted 
benzoates, in addition to their biological substrates. 

[0076] Although analysis of the substrate specificity results for the A-T didomain at a 

10 detailed molecular level awaits a crystal structure of this loading module, some preliminary 
observations can be made based on the substrate screening results and the relative reactivity 
data in Table 1 . With the exception of 2-aminobenzoate and B, only benzoates with 3-, 5-, 
or both 3- and 5-substituents are substrates for the A-T didomain. Binding sites that 
accommodate the 3-amino- and 5-hydroxy- substituents of the biological substrate AHB 

15 can apparently also accommodate alternative substituents at these positions. 

3-Sulfobenzoate, 3-nitrobenzoate, and 3,5-dinitrobenzoate were likely rejected as substrates 
for steric reasons (Figure 7), since both sulfo- and nitro- substituents are significantly larger 
than the amino- and hydroxy- substituents of AHB. In this regard, it is surprising that 
3-methoxybenzoate is accepted as a substrate, albeit a poor one, since the methoxy- 

20 substituent is also significantly larger than either substituent of AHB. The 3-fluoro- and 
3,5-difluorobenzoates are discriminated against by factors of 5 and 30 with respect to their 
chlorinated and brominated counterparts (Table 1). Changes in the electronic properties of 
the aromatic ring upon fluorination may account for these differences. Phenylacetate and 
3-hydroxyphenylacetate do not appear to be utilized as substrates by the A-T didomain, 

25 despite the reactivity of the corresponding benzoates, B and 3-HB (Table 1). This result 
suggests that the register of the carboxylate is a determinant of its reactivity, as the 
carboxylate of the phenylacetates is displaced by one methylene group relative to the 
benzoates. It should be noted that substituted benzoates were targeted as putative 
substrates in this study; the possibility that the tolerance of the A-T didomain for 

30 substituted benzoates extends to other types of aromatic substrates (e.g., heterocycles) 

remains to be tested. 

[0077] Th e remarkable substrate tolerance of the loading module of rifamycin 

synthetase for substituted benzoates has implications for the production of unnatural natural 
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products through protein engineering. The endogenous loading module of 
6-deoxyerythronolide B PKS was recently replaced by the loading module of the 
avermectin PKS, and the resulting hybrid synthase produced erythromycin derivatives that 
had incorporated branched starter units characteristic of the avermectin family 
5 (Mareden, A. F., et al , Science (1 998) 279;1 99-202). Similarly, exploiting the priming 
promiscuity of the A-T didomain of rifamycin synthetase by appending it to other 
synthases or synthetases, with the goal of generating substituted derivatives of the original 
products is contemplated according to the invention. 

[0078] Finally, this initial characterization of the loading module of rifamycin 
10 synthetase as an NRPS-like A-T didomain sets the stage for investigation of the hybrid 
NRPS/PKS interface in this system. Biochemical studies that combine the NRPS-like 
loading module and PKS module 1 of rifamycin synthetase (in cis or in trans) should allow 
functional and structural questions regarding NRPS/PKS biosynthetic interfaces to be 
addressed. 

15 [0079] The nucleotide sequences encoding a multiplicity of PKS permits their use in 
recombinant procedures for producing a desired PKS and for production of the proteins 
useful in postmacfolide conversions, as well as modified forms thereof. For example, the 
nucleotide sequences for genes related to the production of erythromycin is disclosed in 
U.S. 6,004,787 and U.S. 5,998,194; for avermectin in U.S. 5,252,474; for FK506 in 

20 U.S. 5,622,866; for rifamycin in W098/7868; for spiramycin in U.S. 5,098,837. These are 
merely examples. Portions of, or all of, the desired coding sequences can be synthesized 
using standard solid phase synthesis methods such as those described by Jaye, et ai, 
J. Biol Chem. (1984) 259:6331 and which are available commercially from, for example, 
Applied Biosystems, Inc. 

25 [0080] A portion of the PKS which encodes a particular activity can be isolated and 
manipulated, for example, by using it to replace the corresponding region in a different 
modular PKS. In addition, individual modules of the PKS may be ligated into suitable 
expression systems and used to produce the portion of the protein encoded by the open 
reading frame and the protein may then be isolated and purified, or which may be 

30 employed in situ to effect polyketide synthesis. Depending on the host for the recombinant 
production of the module or an entire open reading frame, or combination of open reading 
frames, suitable control sequences such as promoters, termination sequences, enhancers, 
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and the like are ligated to the nucleotide sequence encoding the desired protein. Suitable 
control sequences for a variety of hosts are well known in the art. 

[0081] The availability of these nucleotide sequences expands the possibility for the 
production of novel polyketides and their corresponding antibiotics using host cells 
5 modified to contain suitable expression systems for the appropriate enzymes. By 

manipulating the various activity-encoding regions of a donor PKS by replacing them into 
a scaffold of a different PKS or by forming hybrids instead of or in addition to such 
replacements or other mutagenizing alterations, a wide variety of polyketides and 
corresponding antibiotics may be obtained. These techniques are described, for example, 
10 in U.S. Serial No. 09/073,538 filed 6 May 1998 and incorporated herein by reference. 

[0082] A polyketide synthase may be obtained that produces a novel polyketide by, for 
example, using the scaffolding encoded by all or the portion employed of a natural synthase 
gene. The synthase will contain at least one module that is functional, preferably two or 
three modules, and more preferably four or more modules and contains mutations, 
1 5 deletions, or replacements of one or more of the activities of these functional modules so 
that the nature of the resulting polyketide is altered. This description applies both at the 
protein and genetic levels. Particularly preferred embodiments include those wherein a KS, 
AT, KR, DH or ER has been deleted or replaced by a version of the activity from a 
different PKS or from another location within the same PKS. Also preferred are 
20 derivatives where at least one noncondensation cycle enzymatic activity (KR, DH or ER) 
has been deleted or wherein any of these activities has been mutated so as to change the 
ultimate polyketide synthesized. 

[0083] Thus, in order to obtain nucleotide sequences encoding a variety of derivatives 
of the naturally occurring PKS, and a variety of polyketides, a desired number of constructs 
25 can be obtained by "mixing and matching'* enzymatic activity-encoding portions, and 
mutations can be introduced into the native host PKS gene cluster or portions thereof 

[0084] Mutations can be made to the native sequences using conventional techniques. 

The substrates for mutation can be an entire cluster of genes or only one or two of them; the 

substrate for mutation may also be portions of one or more of these genes. Techniques for 

30 mutation include preparing synthetic oligonucleotides including the mutations and inserting 

the mutated sequence into the gene encoding a PKS subunit using restriction endonuclease 

digestion (See, e.g., Kunkel, T. A., Proc. Natl Acad. Sci. USA (1985) 82:448; Geisselsoder, 

et al, BioTechniques (1987) 5:786) or by a variety of other art-known methods. 
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(0085] Random mutagenesis of selected portions of the nucleotide sequences encoding 
enzymatic activities can also be accomplished by several different techniques known in the 
art, e.g., by inserting an oligonucleotide linker randomly into a plasmid, by irradiation with 
X-rays or ultraviolet light, by incorporating incorrect nucleotides during in vitro DNA 
5 synthesis, by error-prone PCR mutagenesis, by preparing synthetic mutants or by damaging 
plasmid DNA in vitro with chemicals. 

[0086] In addition to providing mutated forms of regions encoding enzymatic activity, 
regions encoding corresponding activities from different PKS synthases or from different 
locations in the same PKS synthase can be recovered, for example, using PCR techniques 

10 with appropriate primers. By "corresponding" activity encoding regions is meant those 
regions encoding the same general type of activity - e.g., a ketoreductase activity in one 
location of a gene cluster would "correspond" to a ketoreductase-encoding activity in 
another location in the gene cluster or in a different gene cluster; similarly, a complete 
reductase cycle could be considered corresponding - e.g., KR/DH/ER would correspond to 

15 KR alone. 

[0087] If replacement of a particular target region in a host polyketide synthase is to be 
made, this replacement can be conducted in vitro using suitable restriction enzymes or can 
be effected in vivo using recombinant techniques involving homologous sequences framing 
the replacement gene in a donor plasmid and a receptor region in a recipient plasmid. Such 

20 systems, advantageously involving plasmids of differing temperature sensitivities are 
- described, for example, in PCT application WO 96/40968. 

[0088] Finally, polyketide synthase genes, like DNA sequences in general, in addition 
to the methods for systematic alteration and random mutagenesis outlined above, can be 
modified by the technique of "gene shuffling" as described in U.S. patent 5,834,458, 

25 assigned to Maxygen, and U.S. patents 5,830,721, 5,81 1,238 and 5.605,793, assigned to 
Affymax. In this technique, DNA sequences encoding bPKS are cut with restriction 
enzymes, amplified, and then re-ligated. This results in a mixture of rearranged genes 
which can be assessed for their ability to produce polyketides. The ability to produce 
polyketides in easily transformed hosts, such as E. coli, makes this a practical approach. 

30 [0089] There are five degrees of freedom for constructing a polyketide synthase in 

terms of the polyketide that will be produced. First, the polyketide chain length will be 

determined by the number of modules in the PKS. Second, the nature of the carbon 

skeleton of the PKS will be determined by the specificities of the acyl transferases which 
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determine the nature of the extender units at each position — e.g., malonyl, methyl malonyl, 
or ethyl malonyl, etc. Third, the loading domain specificity will also have an effect on the 
resulting carbon skeleton of the polyketide. Thus, the loading domain may use a different 
starter unit, such as acetyl, propionyl, butyryl and the like. Fourth, the oxidation state at 
5 various positions of the polyketide will be determined by the dehydratase and reductase 
portions of the modules. This will determine the presence and location of ketone, alcohol, 
double bonds or single bonds in the polyketide. Finally, the stereochemistry of the 
resulting polyketide is a function of three aspects of the synthase. The first aspect is related 
to the AT/KS specificity associated with substituted malonyls as extender units, which 

10 affects stereochemistry only when the reductive cycle is missing or when it contains only a 
ketoreductase since the dehydratase would abolish chirality. Second, the specificity of the 
ketoreductase will determine the chirality of any P-OH. Finally, the enoyl reductase 
specificity for substituted malonyls as extender units will influence the result when there is 
a complete KR/DH/ER available. 

15 [0090] One useful approach is to modify the KS activity in module 1 which results in 

the ability to incorporate alternative starter units as well as module 1 extended units. This 
approach was illustrated in PCT application US/96/1 1317, incorporated herein by 
reference, wherein the KS-I activity was inactivated through mutation. Polyketide 
synthesis is then initiated by feeding chemically synthesized analogs of module 1 diketide 

20 products. The methods of the invention can then be used to provide enhanced amount of 
extender units. 

[0091] Modular PKSs have relaxed specificity for their starter units (Kao, et al, 
Science (1994), supra). Modular PKSs also exhibit considerable variety with regard to the 
choice of extender units in each condensation cycle. The degree of p-ketoreduction 

25 following a condensation reaction has also been shown to be altered by genetic 

manipulation (Donadio, et al, Science (1991), supra; Donadio, S. et al, Proc. Natl. Acad. 
Set USA (1993) 90:71 19-7123). Likewise, the size of the polyketide product can be varied 
by designing mutants with the appropriate number of modules (Kao, C. M., et aL, J. Am. 
Chem. Soc. (1994) 116:11612-11613). Lastly, these enzymes are particularly well-known 

30 for generating an impressive range of asymmetric centers in their products in a highly 

controlled manner. The polyketides and antibiotics produced by the methods of the present 
invention are typically single stereoisomeric forms. Although the compounds of the 
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invention can occur as mixtures of stereoisomers, it is more practical to generate individual 
stereoisomers using the PKS systems. 

[0092] The polyketide products of the PKS may be further modified, typically by 
hydroxylation, oxidation and/or glycosylation, in order to exhibit antibiotic activity. 
5 [0093] Methods for glycosylating the polyketides are generally known in the art; the 
glycosylation may be effected intracellular^ by providing the appropriate glycosylation 
enzymes or may be effected in vitro using chemical synthetic means as described in U.S. 
Serial No. 09/073,538 incorporated herein by reference. 

[0094] The antibiotic modular polyketides may contain any of a number of different 

10 sugars, although D-desosamine, or a close analog thereof, is most common. For example, 
erythromycin, picromycin, narbomycin and methymycin contain desosamine. 
Erythromycin also contains L-cladinose (3-O-methyl mycarose). Tylosin contains 
mycaminose (4-hydroxy desosamine), mycarose and 6-deoxy-D-allose. 2-acetyl-l- 
bromodesosamine has been used as a donor to glycosylate polyketides by Masamune, et al t 

15 J. Am. Chem. Soc. (1975) 97:3512, 3513. Other, apparently more stable, donors include 
glycosyl fluorides, thioglycosides, and trichloroacetimidates; Woodward, R. B., et al., 
J. Am. Chem. Soc. (1981) 103:3215; Martin, S. F., etal. Am, Chem. Soc. (1997) 119:3193; 
Toshima, K., et al. J. Am. Chem. Soc. (1995) 1 17:3717; Matsumoto, T., et al, Tetrahedron 
Lett (1988) 29:3575. Glycosylation can also be effected using the macrolides as starting 

20 materials and using mutants of S. erythraea that are unable to synthesize the macrolides to 
make the conversion. 

[0095] In general, the approaches to effecting glycosylation mirror those described 
above with respect to hydroxylation. The purified enzymes, isolated from native sources or 
recombinantly produced may be used in vitro. Alternatively, glycosylation may be effected 

25 intracellularly using endogenous or recombinantly produced intracellular glycosylases. In 
addition, synthetic chemical methods may be employed. 

[0096] If the hosts ordinarily produce polyketides, it may be desirable to modify them 
so as to prevent the production of endogenous polyketides by these hosts. Such hosts have 
been described, for example, in U.S. Patent No. 5,672,491, incorporated herein by 

30 reference, which describes S. coelicolor CH999 used in the examples below. In such hosts, 

it may not be necessary to provide enzymatic activity for posttranslational modification of 

the enzymes that make up the recombinantly produced polyketide synthase; these hosts 

generally contain suitable enzymes, designated holo-ACP synthases, for providing a 
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pantetheinyl residue needed for functionality of the synthase. However, in hosts such as 
yeasts, plants, or mammalian cells which ordinarily do not produce polyketides, it may be 
necessary to provide, also typically by recombinant means, suitable holo-ACP synthases to 
convert the recombinantly produced PKS to functionality. Provision of such enzymes is 
5 described, for example, in PCT application WO 98/27203, incorporated herein by 
reference. 

[0097) Again, depending on the host, and on the nature of the product desired, it may 
be necessary to provide "tailoring enzymes" or genes encoding them, wherein these 
tailoring enzymes modify the macrolides produced by oxidation, hydroxylation, 
10 glycosylation, and the like. 

[0100] The encoding nucleotide sequences are operably linked to promoters, enhancers, 
and/or termination sequences which operate to effect expression of the encoding nucleotide 
sequence in host cells compatible with these sequences; host cells modified to contain these 
sequences either as extrachromosomal elements or vectors or integrated into the 
1 5 chromosome, and methods to produce PKS and post-PKS enzymes as well as polyketides 
and antibiotics using these modified host cells. Multiple vector systems for use in 
organisms such as E. coli are contemplated. 

[0101] The vectors used to perform the various operations to replace the enzymatic 
activity in the host PKS genes or to support mutations in these regions of the host PKS 
20 genes may be chosen to contain control sequences operably linked to the resulting coding 
sequences in a manner that expression of the coding sequences may be effected in a 
appropriate host. However, simple cloning vectors may be used as well. 

[0102] Particularly useful control sequences are those which themselves, or using 
suitable regulatory systems, activate expression during transition from growth to stationary 
25 phase in the vegetative mycelium. The system contained in the illustrative plasmid pRM5, 
Le. 9 the actVactTW promoter pair and the acfII-ORF4, an activator gene, is particularly 
preferred. Particularly preferred hosts are those which lack their own means for producing 
polyketides so that a cleaner result is obtained. Illustrative host cells of this type include 
the modified S. coelicolor CH999 culture described in PCT application WO 96/40968 and 
30 similar strains of S. lividans. 

[0103] Methods for introducing the recombinant vectors of the present invention into 
suitable hosts are known to those of skill in the art and typically include the use of CaCk or 
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other agents, such as divalent cations, lipofection, DMSO, protoplast transformation and 
electroporation. 

[0104] As disclosed in Serial No. 08/989,332 filed 1 1 December 1997, incorporated 
herein by reference, a wide variety of hosts can be used, even though some hosts natively 
5 do not contain the appropriate post-translational mechanisms to activate the acyl carrier 
proteins of the synthases. These hosts can be modified with the appropriate recombinant 
enzymes to effect these modifications. 

[0105] To demonstrate the power of engineering modular polyketide synthases in a 
new heterologous system, we attempted to construct a derivative of DEBS in which a PKS 

10 module was fused to a nonribosomal peptide synthetase (NRPS)-like module 

(Mootz, H. D., et aU Curr. Opin. Chem. Biol (1997) 1:543). The first module of the 
rifamycin synthetase has recently been shown to be an NRPS-like module comprised of 
two domains: an adenylation (A) domain and a thiolation (T) domain (Admiraal, S. J., 
et al., Biochemistry Submitted). The A domain activates 3-amino-5-hydroxybenzoate (as 

15 well as benzoate and several benzoate derivatives, (Admiraal, supra)) in an ATP-dependent 
reaction, and transfers the aryl adenylate onto the phosphopantetheine arm of the T domain 
(Figure 7). This NRPS-like module was fused upstream of the first condensation module 
of DEBS in lieu of the loading didomain of DEBS (The construction of plasmid pBP165, 
carrying the rifamycin loading didomain fused to DEBS1 as well as the pccAB genes, is 

20 described in Example 11.) In the presence of exogenous propionate and benzoate, the 

resulting strain of E. coli produced the expected 6dEB analog (compound 3), as confirmed 
by NMR and mass spectrometry (Figure 7) ( 13 C-NMR (CDC1 3 , 500 MHz) 5 213.76, 
177.43, 79.70, 76.60, 71.24, 37.72 (enriched carbon atoms only). Mass Spectrometry 
(AP-CI) for expected ^Q^CeHssOeNa: 463.2757; observed: 463.2847). 

25 [0106] In summary, we have demonstrated the feasibility of engineering E. coli to 
produce complex polyketide natural products. Multiple changes were made to the E. coli 
genome for relevant 6dEB production, including introduction of the three DEBS genes 
from Saccharopolyspora erythraea, introduction of the sfp phosphopantetheinyl 
transferase gene from Bacillus subtilis, introduction of genes encoding a heterodimeric 

30 propionyl-CoA carboxylase from Streptomyces coelicolor, deletion of the endogenous 

prpRBCD genes, and overexpression of the endogenous prpE and birA genes. When gene 

expression was coordinately induced at low temperature, propionate could be converted 

into 6dEB by this metabolically engineered cellular catalyst with excellent kinetic 
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parameters. Given the availability of well-established scalable protocols for fermenting E. 
coli to overproduce bioproducts, the ability to synthesize complex polyketides in this 
heterologous host bodes well for the practical production of these bioactive natural 
products. Equally important, as indicated by the hybrid PKS-NRPS described here, it 
5 opens the door for harnessing the enormous power of molecular biology in E. coli to 
engineer modular polyketide synthases using directed and random approaches. As such, 
organisms such as E. coli that make a hybrid modular polyketide synthases such as one that 
comprises NRPS and incorporate a variety of benzoate substrates are also contemplated. 

Starting Material Enhancement and Variation 

1 0 [0107] Thus, proteins (and their encoding sequences) wherein the proteins catalyze the 
production of starter and/or extender units can be used to enhance the production of 
polyketides by providing a considerable variety of these starter and extender units at higher 
levels than would ordinarily be produced. As set forth above, the required enzymes used to 
modify the microbial hosts, including those which are effective in providing for the 

1 5 production of, or enhancing the production of, starter and/or extender units may be supplied 
as extrachromosomal replicating elements, but are preferably introduced into the host 
genome, for example, by techniques of homologous recombination. By integrating the 
desired expression systems into the genome, the need for selection pressure is avoided and 
suitable production strains are obtained which are stable and can be used under ideal 

20 production conditions without additional antibiotics. In addition, the host may contain 
silent genes which are useful in the production of substrates that are activated by 
recombinant techniques. 

[0108] Because the proteins catalyze reactions to obtain starter and/or extender units 
using a variety of substrates, they are versatile tools in enhancing the availability of starter 

25 and extender units for a wide variety of PKS, whether modified or unmodified. As stated 
above, particularly useful are the products of the mat ABC operon (or analogous operons in 
other species) and the propionic carboxylase encoded by the pccB and accA2 along with 
accAl genes (or their analogs in other species). These enzymes and their encoding 
sequences are useful in view of the discovery that the mat ABC operon and the propionic 

30 carboxylase-encoding genes provide enzymes which not only carry out the required 

reactions on a variety of substances, but also do so with the production of products with the 
stereochemistry required for use in polyketide synthesis. 
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[0109J The ability of the genes described herein to provide appropriate starter and 
extender units was established as described below. 

Improved Culture Conditions 

[0110] In addition to the modification of the hosts described above, the culture 
5 conditions of the microbe may be modified to improve the yields of polyketide synthesized. 
The success of these conditions can be measured by the levels of, or rates of, polyketides 
produced by at least 5%, preferably 10%, more preferably 25%, and most preferably at 
least by 50%. Enhanced production levels of 200% may also be achieved. These 
enhancements are perhaps relatively modest, and it has been possible by modifying the 

1 0 genomic complement of the host organism or by manipulating the culture conditions or 
both to enhance the levels 5 fold, 10 fold, 20 fold, 100 fold, 200 fold or 500 fold. The 
production levels or rate of production are measured under the set of conditions where 
everything else is held constant other than the parameter being varied. 

[0111] As shown below, the yield of polyketide is markedly increased by 

15 (1) maintaining relatively steady nutrient levels throughout the fermentation; (2) providing 
an auxiliary thioesterase; (3) batch feeding additional precursor for starter and/or extender 
units; and (4) permitting growth to high cell densities. Using one or more of these 
strategies, the yield of polyketide will be enhanced by at least 1.25 fold, preferably 1.5 fold, 
more preferably 2 fold, and in many instances 5 fold, 10 fold, 25 fold, 50 fold, 100 fold, 

20 200 fold or 500 fold. 

[0112] As shown in Example 12, addition of precursor after the initial concentration is 
depleted is beneficial, as is providing an expression system for thioesterase II (TEII). 
Especially beneficial is maintaining nutrient levels at a relatively constant value during the 
course of polyketide synthesis and permitting cell growth to high cell densities. 

25 [0113] By "relatively constant value" of nutrients is meant maintaining nutrients at low 
concentration, typically below 2 g/L in the case of E. coli. By "high cell densities" is 
meant OD 6 oo values of about 40-80, preferably 50-70 for E. coli. 
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Example 1 

Production of Malonyl CoA and 2S-Methvlmalonyl CoA Using the CoA Synthetase 

[0114J & c °ti strain L8 has a temperature-sensitive mutation in the acetyl-CoA 
carboxylase gene such that malonyl-OoA cannot be produced from acetyl-CoA at 37*C. 
5 However, the gene product is able to effect this conversion at 30°C. See Harder, M.E., 
et al f Proc. Natl. Acad. Sci. (1972) 69:3105-3109. Since acetyl-CoA carboxylase 
conversion of acetyl-CoA into malonyl-CoA is the only known route for malonyl-CoA 
production in E. coli, and since malonyl-CoA is essential for fatty acid biosynthesis, this 
mutant strain of E. coli can grow at 30*C, but not at 37°C. A transformant of L8 carrying 

10 the mat ABC operon is produced by transforming with the plasmid pMATOP2 which 
contains the matA, matB and matC genes under control of their native promoter and is 
described in An, J.H., et al t Eur. J. Biochem. (1998) 257:395-402. This transformant is 
still unable to grow at 37 a C in the absence of malonic acid; however, addition of 1-5 mM 
malonic acid to the medium permits it to grow at this temperature. (In the absence of the 

15 plasmid, malonic acid is unable to support growth at 37°C.) The concentration of the 

extracellular malonic acid is important, however, as increasing the concentration to 40 mM 
results in an absence of growth, possibly due to a metabolic imbalance caused by 
overproduction of malonyl CoA in comparison to the amount of coenzyme A available. 
Lethality was also induced in XLl-Blue (a wild-type strain of E. coli) in the presence of the 

20 plasmid carrying the mat ABC operon and high concentrations of methylmalonic acid. 

[0115] Nevertheless, the results set forth above demonstrate that the protein encoded by 
matB produces malonyl-CoA in vivo under physiological conditions as long as free malonic 
acid is available; and transported into the cells by the protein encoded by matC. Thus, the 
matBC genes can be used to supplement malonyl-CoA availability in an E. coli cell in 

25 which complex polyketides are to be produced by feeding malonic acid. 

[0116] In addition to converting malonic acid into malonyl-CoA, MatB has also been 
shown to convert methylmalonic acid into methylmalonyl-CoA. However the 
stereochemistry of the resulting product has not been reported. This is important, because 
modular polyketide synthases are known to only accept one isomer of methylmalonyl-CoA, 

30 namely 25-methylmaIonyl-CoA (Marsden, A. F., et al, Science (1 994) 263 :378-380). To 
investigate whether MatB can make the correct isomer of methylmalonyl-CoA, construct 
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encoding a glutathione-S-transferase fusion (GST-MatB) was used to produce this protein. 
See An, J. H., et al t Biochem. 1 (1999) 344:159-166. The GST-MatB protein was purified 
according to standard protocols as described and mixed with (module 6+TE) of the 
erythromycin polyketide synthase, also expressed in E. coli as described by Gokhale, R. S., 
5 et aL, Science (1999) 284:482-485. 

[0117] In earlier studies, Applicants have established the activity of (module 6+TE) by 
demonstrating its ability to catalyze the following reaction in vitro. 

[0118] N-acetylcysteamine thioester of (2S, 3R)-2-methyl-3-hydroxy-pentanoic acid + 
2 (RS)-methylmalonyl-CoA + NADPH -> (2R,3S,4S,5R)- 2,4-dimethyl-3,5-dihydroxy-n- 
10 heptanoic acid 8-lactone +NADP + . 

[0119] The methylmalonic thioester product obtained using methylmalonic acid as the 
substrate for GST-MatB provides the correct stereochemistry to serve as the source of the 
extender unit in this reaction. More specifically, to generate the substrate for the above 
polyketide synthesis in situ, the following reaction mixture (containing 6+TE and 
1 5 GST-MatB) was prepared in a reaction buffer of 1 00 mM Na Phosphate (pH7) buffer, 
1 mM ethylenediaminetetraacetic acid (EDTA), 2.5 mM dithiothreitol (DTT) and 20% 
glycerol: 

40 mM methylmalonic acid (pH 6) 
16.6mMMgCl 2 
20 5 mM ATP 

5 mM CoASH 
13.3 mM NADPH 

1 mM N-acetylcysteamine thioester of (2S, 3R)-2-methyl-3-hydroxypentanoic acid 
(prepared in radioactive form). 

25 

[0120] After 4 hrs, the reaction was quenched and extracted with ethyl acetate 
(extracted twice with three times the reaction volume). The samples were dried in vacuo 
and subjected to thin layer chromatography analysis. 

[0121] A positive control was performed under identical conditions to those described 

30 earlier - i.e., conditions wherein (RS)-methylmalonyl-CoA was substituted for the 

combination of methylmalonic acid, MgCl 2 , ATP, CoA SH, and GST-MatB. A negative 

control included all of the components listed above except for the GST-MatB fusion 

protein. The results demonstrated that the two-enzyme system described above is able to 
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produce the expected product in quantities comparable to the positive control reaction. 
This confirms that MatB synthesizes the correct isomer of methylmalonyl-CoA. 

[0122] Thus, MatB/MatC is useful to synthesize both malonyl-CoA and 
2S-methylmalonyl-CoA in vivo for polyketide biosynthesis. This is the first instance of 
5 engineering E. coli with the ability to produce 2S-methylmalonyl-CoA in vivo under 
physiological conditions. Moreover, co-expression of mat A in vivo should allow 
conversion of methylmalonyl-CoA into propionyl-CoA, thereby supplementing available 
sources of this starter unit. 

Example 2 

10 Ability of Propionvl Co A Carboxylase to Generate 2S-Methvlmalonvl Co A 

[0123] To utilize the propionyl carboxylase gene from S. coelicolor described above, 
an £. coli expression host (BL-21 (DE3)) was prepared using the method developed by 
Hamilton, C. M., et al y 1 Bacteriol. (1989) 171 :4617-4622. The new strain (BAP1) 
contains a phosphopantethiene-transferase gene (the sfp gene) from Bacillus subtilis 

1 5 integrated into the prp operon of E. coll The T7 promoter drives sfp expression. In the 
recombination procedure, the prpE gene was also placed under control of the T7 promoter, 
but the rest of the operon was removed. This genetic alteration would ideally provide three 
features: 1) the expression of the sfp protein needed for post-translational modification of 
the DEBS and potentially other polyketide synthases (PKSs); 2) the expression of the prpE 

20 protein, a putative propionyl-CoA synthetase theoretically capable of ligating Co ASH to 
propionate; and 3) a cellular environment that is no longer able to metabolize propionyl- 
CoA as a carbon/energy source. 

[0124] First, it was verified that the BAP1 strain, by virtue of its production of the 
product of the sfp gene was able to effect phosphopantetheinylation of a PKS produced in 

25 these cells. BAP1 was transfected with a plasmid comprising an expression system for 
module 6+TE and the activity of the module produced was compared to the activity of the 
module produced recombinantly in BL-21 (DE3) cells where the sfp gene was plasmid 
borne. These levels were comparable. In contrast, when expressed alone in BL-21 (DE3), 
module 6+TE showed no activity. Additionally, BAP1 was confirmed for its inability to 

30 grow on propionate as a sole carbon source (a property exhibited by E. coli strains such as 
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BL21 (DE3)). BAP1 is a preferred host for the heterologous expression of polyketide 
synthases in conjunction with enzymes such as MatBC and propionyl-CoA carboxylase. 

[0125] The propionyl-CoA carboxylase enzyme was expressed in £. coli under the T7 
promoter. The product enzyme was able to supply substrate for module 6+TE in vitro, 
5 This was demonstrated using the coupling of the methylmalonyl-CoA thioester product of 
the propionyl CoA carboxylase enzyme to the N-acetyl cysteamine thioester of 
(2S,2R)2-methyl-3-hydroxypentanoic acid. The pccB and accA2 genes described above 
which encode the components of the propionyl-CoA carboxylase, were expressed and the 
products individually purified according to standard procedures. Initially, the pccB and 
10 accA2 subunits were allowed to complex on ice in 150 mM phosphate (pH7) and 300 ng 
BSA. After 1 hour, the following substrates were added to a volume of 100 ^il and 
incubated for an additional 30 minutes at 30°C: 

1 mM propionyl-Co A 

50 mM sodium bicarbonate 
15 3mM ATP 

5mMMgCl 2 

[0126] Module 6+TE was then added with the following final set of reagents to give 
200 jil total and allowed to react for an additional hour at 30°C: 
20 10% glycerol 

1.25 mM DTT 
0.5 mM EDTA 
4 mM NADPH 

2 mM N-acetylcysteamine thioester of (2S, 3R)-2-methyl-3-hydroxypentanoic acid 
25 (prepared in radioactive form). 

[0127] The reaction was quenched and extracted as described above, and showed 
formation of expected product. A positive control included racemic malonyl-CoA. When 
either ATP or sodium bicarbonate was removed from the reaction, no product was formed. 
30 The propionyl-CoA carboxylase thus produces a substrate suitable for polyketide synthase 
activity. This is particularly useful for polyketide production, especially in conjunction 
with the new expression host mentioned above, BAP1. 
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J0128] The DEBS protein DEBS1+TE is produced by pRSG32. DEBS1 shows the 
weakest expression of the three DEBS proteins and, until recently, the enzyme showed no 
in vitro activity. However, by growing E. coli containing pRSG32 in M9 minimal medium, 
and inducing protein expression at 22°C, DEBS1+TE activity is now reproducibly 
observed. 

[0129] Plasmids pRSG32 (DEBS1+TE) and pl32 (a plasmid containing the a and p 
components of propionyl-CoA carboxylase) were cotransfected into BAP1. Cultures of 
10 ml M9 minimal media were grown to mid-log phase levels and concentrated to 1 ml for 
induction with IPTG and the addition of 0.267 mM 14 C-propionate. The samples were then 
incubated at 22°C for 12-15 hours. The culture supernatant was then extracted with ethyl 
acetate for analytical TLC. A product ran with the expected positive control and this same 
product was undetectable when using either wild type BL-21 (DE3) or removing pl32. 
thus, the carboxylase forms the correct stereoisomer. 

[0130] In addition, 100 ml cultures of M9 minimal media containing BAP1 
transformed with pRSG32, p!32, and pCY214 (a biotin ligase included to aid biotin's 
attachment to the a subunit of the propionyl-CoA carboxylase) were grown to mid-log 
phase for induction with IPTG and the addition of 100 mg/L l3 C-propionate. The activity 
of the biotinylated subunit (pccA) could be significantly enhanced upon co-expression of 
the E. coli birA biotin ligase gene. Upon extraction of the culture supernatant and 
concentration of the sample, 13 C-NMR confirmed the location of the expected enriched 
product peaks. A subsequent negative control using BL-21 (DE3) failed to yield the same 
spectrum. In addition to demonstrating the ability of E. coli to make complex polyketides 
in vivo, these results also suggest that the prpE protein programmed to express in BAP1 is 
active. 

[0131] Alternatively, M9 minimal media cultures of transformed cells were grown at 
37°C to mid-log phase, followed by induction at 22°C with 0.5-1 mM IPTG, 2.5 g/L 
arabinose, and 26 mg/L or 250 mg/L [1- ,4 C]- or [1- 13 C] -propionate, respectively. 
Regarding the l4 C-l -propionate feeding, individual trahsformants were inoculated into M9 
minimal media cultures with glucose (Maniatis, T., et al. y Molecular Cloning: A Laboratory 
Manual. 1982) in the presence of 50 ng/ml carbenicillin, 25 ng/ml kanamycin, and 
17 ng/ml chloramphenicol at 37°C and 250 rpm. Cultures were grown to mid-log phase 
(OD 6 oo=0.6-0.8), cooled at 22°C for 5 min, and then centrifoged. The cell pellets were 
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resuspended in 1 ml of the remaining supernatant and induced with 1 mM IPTG and 0.25% 
arabinose (for pCY216). In addition, regarding the ,4 C-1 -propionate (at 56 mCi/mmol) was 
added at final concentration of 0.27 mM. The culture was then stirred for an additional 
12-1 5 hrs at 22°C. At this point the culture was centrifuged and 100 p.1 of the supernatant 
5 was extracted (2x) with ethyl acetate (300 each time). The extract was dried in vacuo 
and subjected to TLC analysis. Negative controls included cultures of 
BAPl/pRSG32/pCY216 and BL2ipE3)/pRSG32/pTR132/pCY216. 

[0132] Regarding the ,3 C-1 -propionate feeding, a single transformant of 
BAPl/pRSG32/pTR132/pCY216 was used to start a 3 mLLB culture with 100 \ig/m\ 

10 carbenicillin, 50 |ig/ml kanamycin, and 34 ng/ml chloramphenicol at 37°C and 250 rpm. 
The starter culture was used to inoculate 100 mL M9 minimal media with glucose at the 
same antibiotic concentrations as above. These cultures were grown at 250 rpm and 37°C 
to mid-log phase (OD 6 oo=0.5-0.7), cooled for 15 minutes in a 22°C bath, and induced with 
500 nM IPTG and 0.25% arabinose. 13 C-1 -propionate was added at 100 mg/L and the 

15 cultures were incubated at 22°C for 12-15 hrs. The sample was then centrifuged and the 
supernatant extracted twice with 300 ml ethyl acetate. The sample was dried in vacuo, 
resuspended in CDCb, and analyzed via 13 C-NMR. A negative control was performed with 
BL21(DE3)/pRSG32/pTR132/pCY216. After 12-48 hours the culture supernatant was 
extracted and analyzed for formation of the expected triketide lactone (Figure 7, compound 

20 2) product of DEBS1+TE. Formation of triketide lactone under both feeding conditions 
confirmed the ability of BAP 1 to produce polyketides. 

[0133] Construction ofplasmids pRSG32 t pBP49 f pRSGSO: Genes encoding 
DEBS1+TE (pRSG32), DEBS2 (pBP49) and DEBS3 (pRSGSO) were cloned into pET21c 
(Novagen). The DEBS 1+TE gene was cloned as the Ndel-EcoRl fragment from 

25 pCK12 (6). The DEBS3 gene was cloned as the Ndel-EcoRl fragment from pJRJIO 

(Jacobsen, J. R., et al, Biochemistry (1998) 37:4928). To express the DEBS2 gene, the 
Bsml-EcoRl fragment from pRSG34 (Gokhale, R. S., et al, Science (1999) 284:482), 
which has been used previously to express module 3+TE, was replaced with a Bsml-EcoRl 
fragment encoding module 4. The EcoRI site (in bold) was engineered immediately 

30 upstream of the stop codon of the DEBS2 gene by modifying the natural sequence to the 

following: CGGGGGAGAGGACCTGAATTC 

[0134] It should be noted that a first attempt was made to express the genes encoding 

each of the three DEBS proteins, followed by in vitro assays of protein activity. DEBS3, 
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DEBS2 and a variant of DEBS I , DEBS1+TE were cloned individually into the pET21c 
expression vector and introduced via transformation into E. coli BL21(DE3) harboring the 
sfp phosphopantetheinyl transferase gene on pRSG56 (Kao, C. M., et al., J. Am. Chem. Soc. 
(1995) 1 17:9105-9106, Cortes, J., et al. y Science (1995) 268:1487-1489; Lambalot, R. H., 
5 et al y Chemistry & Biology (1996) 3:923-936; Gokhale, R. S., et al., Science (1999) 
284:482-485). The expression levels of the three DEBS genes were found to be 
comparable to those reported earlier from S. erythraea (Caffrey, P., et al, FEBS Letters 
(1992) 304:225-228) or S coelicolor (Pieper, R., et al. 9 Nature (1995) 378:263-266). 
Individual transformants were used to start 25 ml LB seed cultures containing 100 ng/ml 

10 carbenicillin and 50 jxg/ml kanamycin at 250 rpm and 37°C. These cultures were used to 
inoculate 1 L of LB medium, and the culture was grown under the same conditions. At 
mid-log phase (ODm=0A-0.S) cells were induced with 1 mM IPTG and transferred to a 
30°C incubator. Cells were harvested after 4-6 hours and their protein content was 
analyzed via 7.5% SDS-PAGE. The three DEBS proteins were expressed at ca. 1% total 

15 cellular protein. However, although DEBS3 was found to be active in these lysates, 

DEBS1+TE and DEBS2 lacked any detectable activity (DEBS1+TE (Pieper, R., supra) and 
DEBS3 were assayed as described earlier (Jacobsen, J. R., et al, Biochemistry (1998) 
37:4928). Although an assay for the entire DEBS2 has not yet been developed, the activity 
of module 3 on this protein can be assayed as described earlier (Gokhale, R. S., supra). 

20 Consistent with these results, recombinant DEBS3 could be purified from these lysates 
using procedures described earlier, (Pohl, N. L., et al., J. Am. Chem. Soc. (1998) 
120:1 1206-1 1207), but neither DEBS1+TE nor DEBS2 could be purified in detectable 
quantities. The key parameter that facilitated detection of in vitro activity and subsequent 
purification of DEBS1+TE and DEBS2 was the incubation temperature following IPTG 

25 (isopropylthio-P-D-galactoside) induction. Upon lowering the expression temperature 

from 30°C to 22°C, active DEBS1+TE, DEBS2, and DEBS3 proteins could be detected in 
recombinant E. coli lysates. Hereafter, low temperature induction conditions were 
maintained throughout the course of this study. 

[0135] The use of low temperatures in the favorable expression of large genes and 

30 proteins in E. coli suggests that other large genes and proteins can be expressed in E. coli as 
well as other organisms by beneficially using low temperatures as shown herein. 
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Example 3 

Enhanced Production of 6-dEB in S. coelicolor 

(0136) The presence of the math and matC genes was also able to enhance the 
recombinant production of 6-dEB in S. coelicolor which had been recombinantly modified 
5 to produce this polyketide by insertion of the DEBS gene complex on the vector pCK7. 
The maiB and matC genes were expressed in a recombinant strain of Streptomyces 
coelicolor that produces 50 mg/L 6-deoxyerythronolide B by virtue of plasmid borne 
DEBS genes. The matB and matC genes were inserted immediately downstream of DEBS 
genes on pCK7. 

1 0 [0137] In more detail, the source of the matBC genes is pFL482, a derivative of 

PCR-Blunt (Invitrogen) containing a 5 kb BglWHirtdUl fragment from pMATOP2 which 
carries the matBC genes. The Nsil fragment of pFL482 containing the matBC genes was 
cloned into the unique Nsil site of pCK7 in the same direction as the DEBS genes to yield 
pFL494. Upon transformation of plasmid pFL494 into S. coelicolor CH999, macrolide 

1 5 titer increases of 1 00-300% were obtained in the presence of exogenous 
methylmalonate (0.1-1 g/L). 

[0138] Cultures of S. coelicolor CH999 with or without plasmid pCK7 or pFL494 were 
grown in flasks using R6 medium (sucrose, 103 g/L; K2SO4, 0.25 g/L; MgCl 2 -6H 2 0, 
10.12 g/L; sodium propionate, 0.96 g/L; casamino acids (Difco), 0.1 g/L; trace elements 

20 solution, 2 mL/L; yeast extract (Fisher), 5 g/L; pH 7) supplemented with bis-tris propane 
buffer (28.2 g/L). Trace elements solution contained ZnCl 2 , 40 mg/L; FeCl 3 -6H 2 0, 
200 mg/L; CuCl 2 -2H 2 0, 10 mg/L; MnCl 2 -4H 2 0, 10 mg/L; Na 2 B 4 0 7 10H 2 0, 10 mg/L; 
(NH4)6Mo 7 (}24-4H 2 C>. All media were supplemented with 50 mg/L thiostrepton 
(Calbiochem) to select for plasmid-containing cells, and with 5 mL/L Antifoam B 

25 (JT Baker) for control of foam. Thiostrepton was dissolved in DMSO prior to addition to 
cultures, giving a final DMSO concentration of approximately 1 mL/L of medium. 

[0139] Seed cultures for the fermentation were prepared by inoculation of 50 mL 
medium, followed by growth for two days at 240 rpm and 30°C in 250 mL baffled flasks 
(Bellco). These seed cultures were then used to inoculate 50 mL medium in the presence 

30 or absence of 1 g/L methylmalonate in 250-mL baffled flasks at 5% of final volume. All 
flask cultures were run in duplicate and sampled daily. The entire experiment was repeated 
once to ensure batch-to-batch reproducibility of the results. 
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[0140] Quantitation of 6-dEB and 8,8a-deoxyoleandolide was carried out using a 
Hewlett-Packard 1090 HPLC equipped with an Alltec 500 evaporative light scattering 
detector. HPLC samples were first centrifuged 5 min at 12,000 x g to remove insolubles. 
The supernatant (20 \iL) was applied onto a 4.6 x 10 mm column (Inertsil, CI 8 ODS3, 
5 5 jim), washed with water (1 ml/min for 2 min), and finally eluted onto the main column 
(4.6 x 50 mm, same stationary phase and flow rate) with a 6-min gradient starting with 
100% water and ending with 100% acetonitrile. 100% acetonitrile was then maintained for 
one min. Under these conditions, 6-dEB eluted at 6.2 minutes and 8.8a-deoxyoleandolide 
at 5.8 min. Standards were prepared from 6-dEB purified from fermentation broth. 
1 0 Quantitation error was estimated to be ± 1 0%. 

(0141] As described above, S. coelicolor CH999 either containing pCK7 or containing 
pFL494 were compared for their productivity of 6-dEB and 8,8a-deoxyoleandolide. 

[0142] The results show the following: 

[0143] 1 . Cell density was substantially the same for both strains. 

1 5 [0144] 2. The production of both 6-dEB and 8,8a-deoxyoleandolide is dramatically 
enhanced in CH999/pFL494 as compared to CH999/pCK7, whether measured in terms of 
mg/liters/hour or in mg/liter as a final titer after six days. (8,8a-deoxyoleandolide is the 
same as 6-dEB except that it contains methyl instead of ethyl as position 12, since acetyl 
CoA rather than propionyl CoA is used as a starter unit.) More specifically, after six days 

20 CH999/pFL494 plus methylmalonic acid produced 1 80 mg/1 6-dEB and about 90 mg/1 of 
8,8a-deoxyoIeandolide. If methylmalonic acid was not added to the medium, 6-dEB was 
produced at a level of 130 mg/1 while 8,8a-deoxyoleandolide was produced at bout 40 mg/1. 
For CH999 modified to contain pCK7, in the presence of methylmalonic acid in the 
medium, only 60 mg/1 6-dEB were formed along with about 20 mg/1 of 

25 8,8a-deoxyoleandolide. Without methylmalonic acid, these cells produced slightly less of 
each of these macrolides. 

[0145] 3. CH999/pFL494 completely consumed methylmalonate supplied at 1 g/L by 
day 6. 

[0146] 4. Consumption of 1 g/L methylmalonate results in a cumulative increase in 
30 macrolide of 200 m/L, representing a 35% conversion efficiency of methylmalonate into 
products. 

[0147] 5. CH999/pFL494 shows improved production of both macrolides even in the 

absence of exogenous methylmalonate (see 2 above). 
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10148) 6. Even CH999/pCK7 showed a 20% improvement in 6-dEB production when 
exogenous methylmalonate was added (see 2 above). 

[0149] In addition to enhancing the productivity of known polyketides in natural and 
heterologous hosts, MatB is also used to produce novel polyketides. In contrast to other 
5 enzymes that produce the alpha-carboxylated CoA thioester building blocks for polyketide 
biosynthesis, such as methylmalonyl-CoA mutase (which has a high degree of specificity 
for succinyl-CoA) and acetyl/propionyl-CoA carboxylase (which prefers acetyl-CoA and/or 
propionyl-CoA), MatB is active with respect to a wide range of substrates. In addition to 
malonate and methylmalonate, MatB is able to activate substrates such as ethylmalonate, 

1 0 dimethylmalonate, isopropylmalonate, propylmalonate, allylmalonate, 

cyclopropylmalonate, and cyclobutylmalonate into their corresponding CoA thioesters. 

[0150] Incorporation of these substrates into polyketide synthases requires a suitable 
acyltransferase (AT) which may be engineered into the appropriate module of a polyketide 
synthase, so that it can accept the unnatural substrate. Though none of these dicarboxylic 

15 acids yield detectable quantities of novel compounds when fed to CH999/pFL494, certain 
PKS enzymes naturally possess AT domains with orthogonal substrate specificity. For 
example, the FK506 PKS contains an acyltransferase domain that ordinarily incorporates 
bulky substrates such as propylmalonyl-CoA in preference to substrates such as malonyl- 
CoA or methylmalonyl-CoA, and can thus accept MatB-generated unnatural building 

20 blocks without any PKS engineering. 

[0151] Using a protein engineering strategy described by Lau, J., et al, Biochemistry 
(1999) 38:1643-1651, the AT domain of module 6 of DEBS in pFL494 was modified to 
include the specificity determining segment from the niddamycin AT4 domain which 
incorporates ethylmalonyl-CoA. See: Kakavas, S. J., et al., X Bacterid (1997) 

25 179:7515-7522. The resulting plasmid pFL508 was transformed into CH999. Upon 
feeding this strain with ethylmalonate, mass spectroscopy was able to detect a product 
corresponding to 2-ethyl-6dEB in levels comparable to that of 6dEB. The new compound 
was undetectable in the absence of ethylmalonate or in a control strain lacking the matBC 
genes. 
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Example 4 
Production of 6-dEB in E. coli 

[0152] We have demonstrated the ability of E. coli to produce complex, complete, 
polyketides, when programmed with the ability to express a functional PKS, a 
5 pantetheinyltransferase, and one or more pathways for producing starter and extender units. 
E. coli strain BL-21(DE) obtained from Novagen was modified genetically by inserting the 
phosphopantetheinyl transferase gene (the sfp gene) from Bacillus subtilis into the 
chromosome under the control of the phage T7 promoter by deleting the prp A-D portion of 
the prp operon, thus also placing the prpE locus, which encodes a propionyl CoA 

1 0 synthetase, under control of the T7 promoter. This genetically modified strain was then 
modified to include expression systems for the three genes encoding the DEBS1, DEBS2, 
and DEBS3 proteins, also under control of the T7 promoter as well as genes encoding 
propionyl CoA carboxylase and a gene encoding biotin ligase which is necessary for 
activation of the propionyl CoA carboxylase enzyme. The resulting E. coli contains a 

1 5 complete synthase for 6-dEB, a phosphopantetheinyl transferase necessary for the 

activation of this PKS, the propionyl CoA carboxylase enzymes that supply methylmalonyl 
CoA from propionyl CoA, and an inducible means to produce the endogenous propionyl 
CoA synthase capable of converting exogenous propionate to propionyl CoA. In addition, 
the endogenous system for catabolism of propionate was disarmed. 

20 [0153] Thus, the E. coli are provided enzymes for synthesis of both starter and extender 
units under control of an inducible promoter, the endogenous mechanism for destruction of 
the propionate precursor of the starter and extender units has been disarmed; and 
expression systems (also under inducible promoters) have been provided for the necessary 
PKS proteins along with an expression system for the enzyme for activation of the PKS 

25 proteins. 

[0154] In more detail, the genetically modified BL-2 1 (DE3) strain was prepared 
according to the procedure described in Hamilton, et al, J, Bacteriol (1989) 
171 :461 7-4622, which is incorporated herein by reference. A derivative of pMAK705 
described in this publication, was prepared. In the derived vector, a T7 promoter coupled 
30 to the sfp gene was flanked by a 1 ,000 base pair sequence identical to that upstream of the 
A locus of the prp operon and a 1,000 base pair sequence identical to the sequence 
downstream of the E locus of this operon. The sfp gene was obtained from pUC8-sfp, a 
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plasmid described by Nakano, et ai, Mol Gen. Genet (1992) 232:313-321. The resulting 
integrated sequence deletes the prp loci A-D and inserts the T7 promoter controlling the sfp 
gene in their place and further results in placing the prpE locus under the control of the T7 
promoter. As suggested herein, this site was chosen for sfp gene insertion for two reasons. 
5 First, the prp operon is putatively responsible for propionate catabolism in E. coli 
(Horswill, A. R., and Escalante-Semerena, J. C, 1 Bacteriol (1999) 181:5615-5623). 
Since propionate was intended to be the sole source of carbon building blocks for 6dEB 
biosynthesis (see below), concurrent propionate catabolism and anabolism were deemed 
undesirable. By deleting prpRBCD in the process of sfp integration, the ability of BAP1 to 

1 0 utilize propionate as a carbon and energy source was eliminated. Second, together with the 
sfp gene, the prpE gene in BAP1 was also placed under control of an IPTG-inducible 
promoter such as a T7 promoter. PrpE is thought to convert propionate into propionyl-CoA 
(Horswill, A.R., and Escalante-Semerena, J. C, Microbiology (1999) 145:1381-1388); 
therefore, in the presence of exogenous propionate, propionyl-CoA can be expected to 

15 accumulate inside the cell at the same time as DEBS is expressed in an active form. It is 
noted, however, that it may not be desirable to delete prpRBCD is a production strain. It 
may be desirable in some strains, alternatively, to inactivate only some of the prpRBCD 
genes. The T7 promoter is inducible by BPTG. 

[0155] The resulting genetically altered host, designated BAP 1, was than transfected 

20 with three plasmids each selectable for a different antibiotic resistance. These plasmids are 
as follows: 

pBP130 is derived from pET21 (carb R ) obtained from Novagen and modified to 
contain the DEBS2 and DEBS3 genes under control of the T7 promoter. 

pBP144 is a modified form of pET28 (kan R ) also available from Novagen 
25 containing the pec and DEBS1 genes, also under control of the T7 promoter. 

pCY214 (cm R ) contains the E. coli birA (biotin ligase) gene under the ara promoter 

and is described in Chapman-Smith, et al t Biochem. 1 (1994) 302:881-887. This plasmid 

was obtained as a gift from Dr. Hugo Gramajo. The PCC protein and pec gene are 

described in Rodriguez, etal, Microbiol (1999) 145:3109-3119. 

30 [0156] Construction of plasmids pBPBO, pBP144: The expression vectors pET21c 

and pET28a were first re-engineered by replacing the Bpul 1021-DraIII fragments in these 

vectors with a polylinker possessing the Bpul 1021, Nsil, PstI, Pad and Dralll sites. The 

DEBS2 gene from pBP49 and the DEBS3 gene from pRSG50 were cloned into the pET21c 
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derivative between the Ndel-EcoRI and Nsil-PacI sites, respectively, yielding pBP130 
(25.5 kb). Thus, pBP130 is capable of expressing the DEBS2 and DEBS3 genes under the 
control of the same pT7 promoter. Similarly, pBP144 (20 kb) was constructed from the 
pET28a derivative described above by inserting the pccAB genes from pTR132 (Rodriguez, 
5 E., and Gramajo, H., Microbiology (1999) 145:3109-31 19) and the DEBS1 gene into the 
Ndel-EcoRI and Pstl-PacI sites, respectively. This DEBS1 gene was derived from pRSG32 
by replacing the Spel-EcoRI fragment with a fragment amplified from the 3' end of the 
natural DEBS1 gene using the following oligonucleotides: 5* oligonucleotide: 
TTACTAGTGAGCTCGGCACCGAGGTCCGGGG; 3 f oligonucleotide: 

10 TTGAATTCGGATCGCCGTCGAGCTCCCGGCCGA. Thus, pBP144 expresses the 
pccAB genes and the DEBS1 gene, each under the control of its own pT7 promoter. 

[0157] For the production of 6-dEB, BAP1 cells transformed with pBP130, pBP144, 
and pCY214 were grown in M9 minimal media with the appropriate antibiotics. The 
culture was grown to mid-log phase, followed by induction with IPTG and arabinose and 

15 the concomitant addition of 250 mg/L I3 C-1 -propionate. Induced cultures were grown for 
12-24 hrs at 22°C. (Both the minimal medium and lower temperatures were found to be 
beneficial for DEBS gene expression. This protocol permitted growth-related production 
of 6-dEB, since glucose provided the carbon and energy source for general metabolism, 
while propionate was converted into 6-dEB.) 

20 [0158] After 12-24 h the culture supernatant was extracted with ethyl acetate. The 
organic phase was dried in vacuo, and re-dissolved in CDCI3 for 13 C-NMR analysis. The 
accompanying spectrum showed that 6-dEB was the major l3 C-labeled product. Other 
major ,3 C-labeled compound(s) with peaks in the range of 120-140 ppm are not derived 
from propionate incorporation, as confirmed by a separate experiment in which 

25 l3 C-3-propionate was used in lieu of 13 C-1 -propionate. From the intensities of peaks 

corresponding to 6-dEB, it is estimated that at least 75% of the exogenous propionate was 
converted into 6-dEB. This was consistent with the disappearance of the propionate signal 
from the I3 C NMR spectrum of the culture medium at the end of the fermentation. Negative 
control strains, which lacked either pBP130 or pBP144, failed to yield detectable quantities 

30 of 6-dEB. 

(0159) The foregoing experiments were performed at low cell densities (OD600 in the 
range of 0.5-2.5); a major advantage of synthesizing recombinant products in E. coli is that 
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this bacterium can be grown to extremely high cell densities (ODeoo of 100-200) without 
significant loss in its specific catalytic activity. 

[0160] The use of the matB and C genes or any of their homologs from other organisms 
in a non-native expression system is useful as a general strategy for the in vivo production 
5 of any alpha-carboxylated Co A thioester in any microbial host. The in vivo production of 
such CoA thioesters could be intended to enhance natural polyketide productivity or to 
produce novel polyketides. The mat A gene is also useful to supplement in vivo levels of 
substrates such as acetyl-CoA and propionyl-CoA. Purified MatB is also used for the 
preparative in vitro production of polyketides, since CoA thioesters are the most expensive 
1 0 components in such cell-free synthesis systems. 

Example 5 
Incorporation of Diketides 

[0161] The BAP1 E. coli host organism described in Example 4 was transfected with 
pi 32 which contains an expression system for the PCCA and B subunits and with pRSG36 

15 which contains an expression system for module 6+TE of DEBS3. The transfected cultures 
were grown on minimal selection media for both plasmids and then fed ,4 C labeled 
diketide. When induced and provided with propionate, I4 C labeled triketide was obtained. 

[0162] Alternatively, to co-express all three DEBS genes and the pec genes, vectors 
pET21c and pET28a (Novagen) were modified to express two and three genes, respectively 

20 (The construction of plasmids pBP130 and pBP144 is described in Example 4.) When 
tested individually, protein production was observed from each gene located on both 
plasmids. BAP1 was transformed with these plasmids together with the birA plasmid. 
Individual transformants were cultured, induced and analyzed similar to the experiment for 
DEBS1+TE (above) using [l- 13 C]-propionate. NMR analysis of the crude organic extract 

25 revealed 6-dEB as the major propionate-derived metabolite of these recombinant cells. The 
product was later purified by HPLC and subjected to mass spectrometry yielding a major 
peak of the expected mass. Plasmids pBP130, pBP144, and pCY216 were transformed into 
BAP1 as previously described. Culture conditions were identical to those described for 
i3 C-l -propionate fed at 250 mg/L described above in Example 2. Cultures were sampled 

30 regularly over 3 days. Samples were centrifuged and the supernatant (either 2 or 20 jiL) 
loaded onto a Hewlett-Packard 1090 HPLC using an initial 4.6 x 10 mm column (Inertsil, 
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C18 ODS3, 5 ^m), washed with water (1 ml/min for 2 min), and then loaded onto a main 
4.6x50 mm column with the same stationary phase and flow rate. A 6-minute gradient was 
then applied starting with 100% water and finishing with 100% acetonitrile maintained for 
an additional 1 minute. The samples were analyzed with an Alltech evaporative light 
5 scattering detection system (ELSD500), and a peak at 6.4 min retention time was 
confirmed as hepta- 13 C-labeled 6dEB by mass spectrometry (MWobs - 393). Product 
concentrations were measured in comparison to standard 6-dEB samples using the same 
detection scheme. At the end of the incubation period, the entire culture supernatant was 
extracted as before with ethyl acetate, dried, and analyzed by 13 C- NMR. Additionally, the 

10 final cell pellet was analyzed via SDS-PAGE to confirm the presence of the three DEBS 
proteins and the PCC. No differences were observed between the expression levels of the 
proteins at 12 h and 48 h post-induction. The stability of each plasmid in 
BAPl/pBP130/pBP144/pCY216 was also tested at 12 h and 36 h post-induction. No loss 
of pBP144 was observed at either time-point, whereas pBP130 and pCY216 were 

15 maintained in 50% and 35% of the colonies at 12 h and 36 h, respectively. No 

rearrangement of any plasmid was detected at either time-point, based on restriction 
analysis of multiple re-transformed colonies. Negative controls for the l3 C-NMR 
experiments included BAPl/pBP130/pCY216, BAPl/pBP144/pCY216, and 
BAPl/pBPl 30/pBP160/pCY216. (Plasmid pBP160 carries a C -> A null mutation at the 

20 active site of the KS domain in module 1 (Kao, C, et al, Biochemistry (1996) 35:12363). 
To quantify the productivity of this novel polyketide cellular system, culture samples were 
taken periodically, and the concentration of 6-dEB was measured (Figure 9). From this 
data it can be calculated that the specific productivity of this cellular catalyst is 0.1 mmol 
6dEB/g cellular protein/day. This is significantly superior to wild-type S. erythraea and 

25 compares well to an industrially relevant strain that overproduces erythromycin (0.2 mmol 
erythromycin/g cellular protein/day) (Minas, W., et al, Biotechnol Prog. (1998) 14:561) as 
a result of a decades-long program of directed strain improvement based on random 
mutagenesis. 



-44- 



WO 02/068613 



PCT/US02/06399 



Example 6 

Construction. Expression, and Purification of the A-T Loading Didomain 

[0163] The A-T loading didomain is naturally present at the N-terminus of RifA. To 
investigate this didomain biochemically, it was removed from the RifA protein context. 
5 Therefore, the sequence encoding the isolated A-T didomain was subcloned into an 

expression vector, using an Ndel restriction site engineered at the transcriptional start site 
of RifA and a NotI restriction site introduced in the linker region between the C-terminal 
end of the consensus T domain and the N-terminal end of the consensus ketosyhthase 
domain of module 1, as described below in more detail. Thiolation domains require 

10 covalent attachment of the 4'-phosphopantetheine moiety of CoA to a conserved serine to 
be active (Walsh, C. T., et at. 9 Curr. Opin. Chem. Biol. (1997) 1:309-315). The Sfp 
phosphopantetheinyl transferase from B. subtilis, which is capable of converting the apo 
forms of many heterologous recombinant proteins into the holo forms, was therefore 
co-expressed with the A-T didomain in the holo enzyme preparation (Lambalot, R. H., et 

15 al. 9 Chem. Biol (1996) 3:923-936; Quadri, L. E. N., et al, Biochemistry (1998) 

37:1585-1595). The apo and holo forms of the A-T didomain were produced in E. coli as 
C-terminal hexahistidine-tagged fusion proteins and were purified by nickel affinity 
chromatography to >98% homogeneity, as describe more fully below. Purified 
recombinant apo and holo A-T didomain (encoded by plasmid pS A8) were overproduced 

20 in E. coli y and protein samples were resolved by sodium dodecyl sulfate polyacrylamide gel 
electrophoresis (SDS-PAGE) (4-15%, Bio-Rad) and stained with SimplyBlue Safestain 
(Invitrogen). The apo A-T didomain and the holo A-T didomain each had a molecular 
weight of less than the 75 kD molecular weight marker. The holo A-T didomain had a 
slightly higher molecular weight that the apo A-T didomain. 

25 [0164] Materials. [7- I4 C]-Benzoic acid (57 mCi/mmol) and [7- 14 C]-3-hydroxybenzoic 
acid (55 mCi/mmol) were obtained from American Radiolabeled Chemicals. All other 
substituted benzoic acids, phenylacetic acid, and 3-hydroxyphenylacetic acid were obtained 
from Aldrich in unlabeled form. ATP, CoA, and benzoyl-CoA were supplied by Sigma 
Chemical Company. AHB was synthesized according to a previously published protocol 

30 (Ghisalba, O., et al, J. Antibiot. (1981) 34:64-71). Restriction enzymes were from New 
England Biolabs. 
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[0165] Manipulation of DNA and Strains. DNA manipulations were performed in E. 
coli XL1 Blue (Stratagene) using standard culture conditions. Sambrook, J., et al, 
Molecular Cloning: A Laboratory Manual^ 2nd ed., (1989) Cold Spring Harbor Laboratory 
Press, Plainview, NY. Polymerase chain reactions were carried out using Pfu polymerase 
5 (Stratagene) as recommended by the manufacturer. 

[0166] Construction oj 'an Expression Vector for the A-T 'Didomain. AnNdel 
restriction site was engineered at the start codon of the rifA gene using the primers 
5'-GCGGC CATATGC GCACCGATCTC-3' and 5-AGGGCCCGCTGGCGGGAGAAC-3' 
(mutated bases are shown in bold, and the introduced Ndel restriction site is underlined); 

10 the amplified 2.5 kb fragment was ligated to linearized pCR-Script (Stratagene) to produce 
pHu29. The rifA gene with the engineered Ndel restriction site at the start codon was then 
reconstructed in pHu90-l, a derivative of pRM5 (McDaniel, R., et al. (1993) Science 262, 
1 546-1550), via pHu29, pHu35, pHu50, and pHu5 1 . Flanking restriction sites for Pad and 
PstI were used to transfer the sequence encoding the loading didomain and part of module 

15 1 from pHu90-l into a pUC18 derivative to produce pSA2. The loading didomain and 
module 1 are separated by an -20 amino acid linker region, delineated by the C-terminal 
end of the consensus T domain of the loading didomain and the N-terminal end of the 
consensus ketosynthase domain of module 1 (GenBank accession no. AF040570). To 
isolate the loading didomain from module 1, a NotI restriction site was introduced into the 

20 linker sequence using the primers S'-ACCGAGACCTGCGGGGCGATCAO 1 and 
5'- GCGGCCGC GACGGCCTGCGTG-3' (mutated bases are shown in bold, and the 
introduced NotI restriction site is underlined); the resulting 0.94 kb fragment encodes from 
within the loading didomain into the linker region. This amplified fragment was ligated to 
linearized pCR-Blunt (Invitrogen) to produce pSA4, which was then digested with BamHI 

25 and PstI and ligated to pS A2 digested with the same enzymes to generate pS A6. The 1 .9 
kb Ndel-NotI fragment derived from pSA6 was ligated to Ndel-Notl-digested pET21c 
(Novagen) to produce pSA8, an expression vector for the loading didomain with 
hexahistidine appended to its C-terminus. 

[0167] Expression and Purification of the A-T Didomain. Plasmid pSA8 was 

30 introduced via transformation into E. coli BL21 (Stratagene) for expression of the apo A-T 

didomain. One liter cultures of BL21/pSA8 were grown at 37 °C in 2 L flasks containing 

LB medium supplemented with 100 ng/mL carbenicillin. Expression of the A-T didomain 

was induced with 100 \xM IPTG at an optical density at 600 nm of 0.7. After induction, 
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incubation was continued for 6 h at 30 °C. The cells were then harvested by centrifugation 
at 2500 x g and resuspended in disruption buffer [200 mM sodium phosphate (pH 7.2), 200 
mM sodium chloride, 2.5 mM DTT, 2.5 mM EDTA, 1.5 mM benzamidine, pepstatin 
(2 mg/L), leupeptin (2 mg/L), and 30% v/v glycerol]. 
5 [0168] All purification procedures were performed at 4°C. The resuspended cells were 
disrupted by two passages through a French press at 13,000 psi, and the lysate was 
collected by centrifugation at 40,000 x g. Nucleic acids were precipitated with 
polyethylenimine (0.15%) and removed via centrifugation. The supernatant was made 45% 
(w/v) saturated with ammonium sulfate and precipitated overnight. After centrifugation, 

10 the pellet containing protein was redissolved in 50 mM tris(hydroxymethyl)aminomethane 
hydrochloride (Tris-HCl) (pH 8), 300 mM sodium chloride, 10 mM imidazole, and 10% 
v/v glycerol. This solution was loaded onto a previously equilibrated nickel-nitrilotriacetic 
acid (Ni-NTA) column (2 mL, Qiagen). The column was washed with 20 mM imidazole in 
50 mM Tris-HCl (pH 8), 300 mM sodium chloride, and 10% v/v glycerol, and the A-T 

15 didomain was eluted with 100 mM imidazole in the same solution. Pooled fractions 

containing the A-T didomain were buffer exchanged into 100 mM sodium phosphate (pH 
7.2), 2.5 mM DTT, 2 mM EDTA, and 20% v/v glycerol by gel filtration (PD-10, 
Pharmacia) and concentrated with a Centriprep-50 concentrator (Amicon). The purified 
protein was flash-frozen in liquid nitrogen and stored at -80 °C. Protein concentration was 

20 determined using the calculated extinction coefficient at 280 nm: 49500 M'W 1 (Gill, S. 
C, et al. (1989) Anal. Biochem. 182, 319-326). A typical 1 L culture produced about 30 
mg of purified protein. 

[0169] For expression of the holo A-T didomain, plasmid pSA8 was transformed into 
BL21 containing the plasmid pRSG56 (Gokhale, R. S., et al. (1999) Science 284, 482-485), 

25 which carries a kanamycin resistance gene and the sfp gene. The sfp gene expresses Sfp, a 
non-specific phosphopantetheinyl transferase from B. subtilis that converts the apo protein 
into the holo protein (Lambalot, R. H., et al. (1996) Chem. Biol. 3, 923-936; Quadri, L. E. 
N., et al. (1998) Biochemistry 37, 1585-1595). One liter cultures of this recombinant E. 
coli strain were grown at 37 °C in 2 L flasks containing LB medium supplemented with 

30 100 ng/mL carbenicillin and 50 |ig/mL kanamycin. The expression and purification steps 
for the holo A-T didomain were performed as described above for the apo A-T didomain. 
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Example 7 

Radioactive Labeling of A-T Diddmain to Determine Mechanism of the A-T Didomain 

[0170] For qualitatively assessing the incorporation of B or 3-HB into the A-T 
didomain, reactions contained 5 jiM apo or holo A-T didomain, 50 mM sodium phosphate 
5 (pH7.2), 1 mM DTT, 1 mM EDTA, 15 mM MgCl 2 , 10% glycerol, and 100 \M [7- !4 C]-B 
or [7- ,4 C]-3-HB. In reactions where ATP was included, 5 mM was present. After 
incubation at 30°C for 30 min, reactions were quenched with SDS-PAGE sample buffer 
and electrophoresed on a 4-15% gradient gel (Bio-Rad). The gel was briefly stained with 
Coomassie blue, destined, dried, and autoradiographed. 

10 [0171] As depicted in Figure 2, both models for the mechanism of the A-T didomain 
involve activation of AHB as the aryl-adenylate by the A domain, followed by eventual 
formation of a covalent aryl thioester enzyme intermediate from attack of either aryl-CoA 
(Figure 2 A) or the aryl-adenylate (Figure 2B) by the thiol nucleophile of the 
phosphopantetheine cofactor of the T domain. To investigate these possible mechanisms, 

15 we sought to covalently load the A-T didomain. Although AHB is not available in 

radiolabeled form, in vivo feeding experiments have demonstrated that RifA can also be 
primed by 3-HB (Hunziker, D., etal.,J. Am. Chem. Soc. (1998) 120:1092-1093). 
Reactions containing [ l4 C]-3-HB or the putative substrate [ ,4 C]-benzoate (B) and apo or 
holo A-T didomain were incubated in the presence or absence of MgrATP and 

20 subsequently analyzed by SDS-PAGE autoradiography (Figure 3) as described in detail 
below. Lacking the phosphopantetheine cofactor, the apo A-T didomain could not be 
covalently loaded (lane 1). However, the holo A-T didomain is covalently loaded with 
both B and 3-HB in reactions that require Mg*ATP (lanes 2-5). 

[0172] CoA was not included in the labeling reactions described above, suggesting that 

25 it is not required for covalent loading of the holo A-T didomain. Since the loading 
didomain has been proposed to be a CoA ligase (Figure 2 A) (Schupp, T., et al y FEMS 
Microbiol Lett (1998) 159:201-207; August, P. R., et al, Chem. Biol. (1998) 5:69-79; 
Ghisalba, O., et al 9 J. Antibiot. (1981) 34:64-71), we nevertheless tested the possible- 
involvement of CoA directly. 

30 [0173] HPLC was used to detect the possible benzoyl-CoA formation according to the 
following procedure. Reactions contained 10 apo A-T didomain, 50 mM sodium 
phosphate (pH 7.2), 1 mM DTT, 1 mM EDTA, 15 mM MgCl 2 , 5 mM ATP, 10% glycerol, 
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1 mM Co A, and 1 mM B. In reactions where benzoyl-CoA was included, 100 \iM was 
present. After incubation at 30 °C for the indicated times, 20 samples were injected 
into an HPLC equipped with a CI 8 reverse phase column (VYDAC, 250 x 5 mm) with the 
detector monitoring at 254 nm. A linear gradient between buffer A (25 mM potassium 
phosphate, pH 5.4) and buffer B (100% acetonitrile) from 0% to 50% B was run over 14 
min with a flow rate of 1 mL/min. The substrate and putative product peaks were 
identified by co-injection with authentic standards. 

[0174] If the mechanism shown in Figure 2A is operative, the apo A-T didomain 
should be capable of producing benzoyl-CoA. However, no benzoyl-CoA formation could 
be detected when the apo A-T didomain was incubated with ATP, B, and CoA (Figure 4, 
-benzoyl-CoA traces). To confirm that benzoyl-CoA, if formed, would persist in these 
reaction conditions, benzoyl-CoA was added to an otherwise identical reaction (Figure 4, 
+benzoyl-CoA traces). Benzoyl-CoA is degraded with an observed rate constant of -0.002 
min* 1 , and this degradation is enzyme-independent since the same observed rate constant is 
obtained for reactions in which the apo A-T didomain is omitted (data not shown); this 
slow nonenzymatic degradation is taken into account in the A^ at analysis that follows. 

[0175] Accumulation of 5 nM benzoyl-CoA is readily detectable using this HPLC 
assay. This conservative detection limit allows an upper limit for k^x for the formation of 
benzoyl-CoA by the apo A-T didomain to be calculated, as follows. Accumulation of 5 
HM benzoyl-CoA would indicate that at most 10 \xM benzoyl-CoA was formed during the 
300 min reaction, as the half-life of benzoyl-CoA is -300 min under these conditions (ti/2 = 
ln2/£ob S ; tabs ~ 0.002 min '). Therefore, the velocity of benzoyl-CoA formation is at most 
0.03 |iM/min (10 ^iM/300 min). This corresponds to taat < 0.003 min* 1 , as the 
concentration of the apo A-T didomain in these reactions was 10 jiM (taat = v/[E] t ). As 
described below, taat for covalent loading of the holo A-T didomain with B is 0.14 min" 1 . 
Therefore, benzoyl-CoA is not a competent intermediate in the arylation reaction, as the 
rate constant for its formation is at least 50 fold less than the rate constant for formation of 
E-B. These results indicate that the CoA ligase model depicted in Figure 2A is not viable 
for the A-T loading didomain of rifamycin synthetase. 
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Example 8 

Direct Measurement of Kinetic Parameters for the Holo A-T Didomain 

[0176] Typical reactions contained 1-10 nM holo A-T didomain, 50 mM sodium 
phosphate (pH 7.2), 1 mM DTT, 1 mM EDTA, 5 mM ATP, 15 mM MgCl 2 , 10% glycerol, 
5 0.5-5 nCi/mL [7- 14 C]-B or [7- I4 C]-3-HB, and varying concentrations of unlabeled B or 
3-HB . Unlabeled B and 3-HB stocks were adjusted to the reaction pH prior to addition. 
Reactions were incubated at 30 °C, and at desired time points 20 |iL aliquots were 
quenched in 1 mL of ice-cold 5% trichloroacetic acid and 200 jig of bovine serum albumin 
(Sigma) was added to this mixture to aid precipitation of the protein. The precipitate was 

10 pelleted by centrifiigation, washed with 0.5 mL of 5% trichloroacetic acid and solubilized 
in 0.5 mL of a 100 mM phosphate (pH 8), 2% SDS solution. This solution was combined 
with 4.5 mL of liquid scintillation fluid (Formula 989, Packard), and the incorporated 14 C 
label, corresponding to E-B or E-3-HB, was quantified by liquid scintillation counting. 
Reaction rates were linearly dependent on enzyme concentration. Data analysis was 

1 5 performed using Kaleidagraph (Synergy Software), and exponential fits to the data 
typically gave R> 0.99. 

[0177] B and 3-HB are substrates for the holo A-T didomain, as shown qualitatively in 
Figure 3. To quantitatively assess these benzoates as substrates for aryl-adenylate 
formation followed by arylation of the thiol of the phosphopantetheine cofactor of the T 

20 domain, we utilized the protein precipitation assay described above. As discussed above, 
aliquots from reactions containing holo A-T didomain, 0.5-5 nCi/mL [7- ,4 C]-B or 
[7- l4 C]-3-HB, and varying concentrations of unlabeled B or 3-HB were quenched with 
trichloroacetic acid, and the amount of radiolabeled protein in each washed protein pellet 
was determined by liquid scintillation counting. Initial velocities of E-B or E-3-HB 

25 formation as a function of B or 3-HB concentrations were obtained using this method and 
used to generate the saturation curves shown in Figure 5. Best fits of the data to a 
saturation model give a kc at of 1 .9 min' ! and K M of 1 80 jiM for 3-HB, and a ^at of 
0.14 min" 1 and Km of 170 \xM for B. The ratio of kc^/Ku values for the two substrates 
reveals a 12 fold preference for 3-HB over B by the A-T didomain. Addition of CoA to 

30 these reactions had no effect (data not shown), consistent with the conclusion that the A-T 
didomain is not a CoA ligase. 
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Example 9 

Chase Experiment to Screen for Substrate Specificity of the A-T Didomain 

[0178] Reactions were carried out in 50 mM sodium phosphate (pH 7.2), 1 mM DTT, 
1 mM EDTA, 5 mM ATP, 15 mM MgCl 2 , and 10% glycerol. Each reaction additionally 
5 contained 20 \xM holo A-T didomain and 0.5 mM of a putative substrate, 0.5 mM 

unlabeled B, or no added substrate. After incubation for 30 min at 30 °C, 100 |iL reaction 
aliquots were applied to individual G-25 microspin gel filtration columns (Pharmacia) that 
had been pre-equilibrated with the reaction buffer. The protein component of the applied 
sample was eluted from the microspin column in constant volume by centrifugation, 
10 according to the manufacturers instructions. A 10 jiL aliquot of each eluted protein sample 
was diluted with 2 jiL of a [7- I4 C]-B solution, for a final B concentration of 200 ^M. 
These chase reactions were incubated for 15 min at 30°C prior to analysis by SDS-PAGE 
autoradiography. 

[0179] Based on previous in vivo feeding experiments (Hunziker, D., et al, 7. Am. 

15 Chem. Soc. (1998) 120:1092-1093) and the in vitro results just described, AHB, 3-HB, B, 
and 3,5-dihydroxybenzoate are accepted as substrates by the A-T didomain. 

[0180] To screen for additional substrates that can prime the A-T didomain, the simple 
chase experiment was devised as described above. Holo A-T didomain was first incubated 
with a putative substrate under standard reaction conditions. The reaction mixture was then 

20 passed over a microspin gel filtration column to separate the protein components from the 
putative unreacted substrate. Radiolabeled B was finally added to the protein fraction, and 
the mixture was incubated briefly prior to SDS-PAGE autoradiography. Protein samples 
that had originally been incubated with a substrate would contain covalently loaded 
enzyme-substituted benzoate (E-XB), which would not react with radiolabeled B during the 

25 chase, resulting in little or no detectable enzyme-benzoate (E-B) by SDS-PAGE 

autoradiography. In contrast, protein samples that had originally been incubated with a 
poor substrate or a non-substrate would primarily contain free enzyme (E), which would 
readily react with radiolabeled benzoate (B) during the chase to form E-B, resulting in a 
radioactive band detectable by SDS-PAGE autoradiography. 

30 [0181] The results of this screening experiment for a series of substituted benzoates are 
discussed below. An autoradiograph of a gel (4-15%, Bio-Rad) containing A-T didomain 
samples chased with radiolabeled B after incubation with no substrate; unlabeled B; 
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2- aminobenzoate; 3-aminobenzoate; 4-aminobenzoate; AHB; 3-amino-4-hydroxybenzoate; 
4-amino-2-hydroxybenzoate; 3-bromobenzoate; 3-chlorobenzoate; 3,5-diaminobenzoate; 
3,5-dibromobenzoate; 3,5-dichlorobenzoate; 3,5-difluorobenzoate; 2,3-dihydroxybenzoate; 
3,5-dihydroxybenzoate; 3,5-dinitrobenzoate; 3-fluorobenzoate; 2-hydroxybenzoate; 3-HB; 

5 4-hydroxybenzoate; 3-methoxybenzoate; 3-nitrobenzoate; 3-sulfobenzoate. 

[0182] The first two lanes contain control reactions in which no substrate (lane 1) or 
unlabeled B (lane 2) was present in the initial incubation; as expected, radiolabeled A-T 
didomain was formed in the no substrate control reaction but not in the unlabeled B control 
reaction. Radiolabeled A-T didomain is likewise absent from reactions in which the known 

1 0 substrates AHB (lane 6), 3,5-dihydroxybenzoate (lane 1 6), and 3-HB (lane 20) were 
present in the initial incubation. In addition to these three substrates, ten more likely 
substrates were identified for further investigation based on the absence or diminution of 
radiolabeled A-T didomain as compared to the lane 1 control reaction. These ten substrates 
are 2-aminobenzoate; 3-aminobenzoate; 3-bromobenzoate; 3-chlorobenzoate; 

1 5 3,5-diaminobenzoate; 3,5-dibromobenzoate; 3,5-dichlorobenzoate; 3,5-difluorobenzoate; 

3- fluorobenzoate; and 3-methoxybenzoate. Although the simplest model for the absence of 
radiolabeled A-T didomain in a given reaction is that the substituted benzoate in question 
has been loaded onto the A-T didomain, blocking the enzyme from reaction with 
radiolabeled B during the chase, this experiment does not rule out the possibility that it is 

20 instead a tight binding competitive inhibitor. However, the observation described below 
that the competition between these substituted benzoates and the substrate B is time- 
independent renders the inhibition model unlikely. Radiolabeled A-T didomain was 
formed in the reactions having the following substrates: 4-aminobenzoate; 3-amino-4- 
hydroxybenzoate; 4-amino-2-hydroxybenzoate; 2,3-dihydroxybenzoate; 

25 3,5-dinitrobenzoate; 2-hydroxybenzoate; 4-hydroxybenzoate; 3-nitrobenzoate; and 
3-sulfobenzoate. 

Example 10 

Relative Specificity Determination Using Relative Rate 
Constants for Arvlation of the A-T Didomain 

30 [0183] Armed with the set of likely substrates found in the screening described in the 
chase experiment (Example 9), the relative specificity of the A-T didomain for aryl- 
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adenylate formation followed by arylation of the thiol of the phosphopantetheine cofactor 
of the T domain was determined. Addition of a substituted benzoate to a reaction mixture 
containing radiolabeled benzoate (B) and the holo A-T didomain allowed partitioning 
between reaction with the substituted benzoate (XB) and reaction with B to be followed. 
5 Reactions were performed as described above (Example 8 Kinetic Measurements) but in 
the presence of 50 jiM-5 mM of a series of substituted benzoates. Substituted benzoate 
stocks were adjusted to the reaction pH prior to addition. The rate constant relative to an 
analogous reaction with benzoate (A^ e i) for reaction of a given substituted benzoate with 
respect to reaction of B was determined from the concentrations of B and substituted 
1 0 benzoate in the original reaction ([B], [XB]) and the amount of product present as E-B and 
E-XB, according to the equation in Scheme 1 below. (Fersht, A. R. (1998) in Structure 
and Mechanism in Protein Science pp. 116-11 7, W. H. Freeman, New York.) 



Scheme 1 




• E-XB+AMP + PPi 

. _ J<xb_ _ [E-XBIB] 
rel k B [XB][E-B] 

(B = benzoate, XB = substituted benzoate) 

[0184] The amount of E-XB product in each reaction at a given time point was 
15 determined by subtracting the amount of radiolabeled E-B in the presence of the competing 
substituted benzoate from that obtained at the same time point in an identical reaction 
lacking competitor. The ratio of E-B to E-XB was constant throughout a particular time 
course, indicating that no secondary reactions involving the reaction products were 
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occurring. The constant ratios also support the view that the substituted benzoates are true 
substrates and not high affinity competitive inhibitors, as E-B would continue to 
accumulate in the presence of a competitive inhibitor, resulting in a ratio of E-B to 
apparent E-XB that increases as a function of time. For each substituted benzoate, the 
same Jfe^ value, within error, was obtained for reactions performed at different substituted 
benzoate concentrations. The reactions were repeated for selected substituted benzoates 
using radiolabeled 3-HB instead of B, and the same values (with respect to B), within 
error, were obtained. Each k xt \ value in Table 1 represents an average of at least 4 separate 
determinations. Competition with B for reaction with the A-T didomain by phenylacetate 
and 3-hydroxyphenylacetate could not be detected, so limits for for these compounds 
are reported in Table 1. 

[0185] The fa values in Table 1 represent the A^/Km ratio for a given substituted 
benzoate and B, and as such provide a measure of the specificity of the A-T didomain for 
each substrate (Fersht, A. R. (1998) in Structure and Mechanism in Protein Science pp. 
1 1 6-1 17, W. H. Freeman, New York). The validity of this approach is demonstrated by 
comparing the k xt \ value of 12 obtained for 3-HB with the identical KdYLu ratio of 12 
obtained from direct measurement of /c^/Km for 3-HB and B (Figure 4). The A-T 
didomain exhibits a 10-1000 fold preference for AHB, its biological substrate, over all 
other substrates. 



[0186] To engineer a functional fusion between the A-T loading didomain from the 
rifamycin synthetase and the first module of DEBS, the DNA sequence immediately 
upstream of the KS domain in DEBS module 1 was modified to read as follows: 
CCGGCGAACCGATCGCGATCGTCGCGATGG. The engineered BsaBI site (in bold) 
was fused to the corresponding naturally occurring BsaBI site between the A-T loading 
didomain and the first PKS module of the rifamycin synthetase (Figure 6). The resulting 
fusion was transferred into pBP144 in place of DEBS1, giving rise to pBP165. 



Example 1 1 
Construction of Plasmid pBP165 
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Table 1. 

Relative Rate Constants for Covalent Loading of the A-T Didomain by Substituted 

Benzoates a 
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benzoate 


0) 


2-aminobenzoate 


0.62 ± 0.08 


3-methoxybenzoate 


0.43 ± 0.06 


3-fluorobenzoate 


0.42 ±0.11 


3,5-difluorobenzoate 


0.13 ±0.02 


phenylacetate 


<0.01 


3-hydroxyphenylacetate 


<0.01 



a 30 °C, 50 mM sodium phosphate, pH 7.2, 1 mM DTT, 1 mM EDTA, 15 
5 mM MgCl 2 , 5 mM ATP, 10% glycerol. 

b Rate constant for T domain arylation, relative to T domain arylation by 
benzoate (Scheme 1). 

Example 12 

Analytical Methods 

1 0 (0187] Optical density measurements were made at 600 nm using a Beckman DU650 
spectrophotometer with necessary dilutions made with PBS. Glucose concentrations were 
measured using an enzymatic hexokinase detection kit (Sigma-Aldrich). Acetate and 
propionate concentrations were analyzed using an isocratic HPLC method with 5 mM 
H 2 S0 4 (Agilent 1 100 HPLC series). The column used was a Bio Rad HPLC organic acid 
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analysis column (Arninex HPX-87H) maintained at 55°C with refractive index used to 
detect acetate and propionate from clarified fermentation broth samples (20 \iL injections). 
A separate HPLC assay was used to detect 6dEB as described previously (Lombo, F., et al., 
Biotechnol Prog. (2001) 17(4):612-617). Protein levels were monitored after cell 
5 disruption (sonication using 2 ml samples), clarification, and detection via 
SDS-PAGE/SimplyBlue™ SafeStain (Invitrogen, Carlsbad, CA) staining. 

Preliminary Optimization 

[0188] E. coli BAP1 was used as the host cell (F-ompThsdS B (r B - m B -) gal dcm (DE3) 
AprpRBCD (sjp)). 6dEB was produced by BAPl/pBP130(Carb R )/pBP144(Kan R ). 

10 Pfeiffer, B.A., et al, Science (2001) 291 :1 790-1792, incorporated herein by reference. 

This strain has the necessary PKS and auxiliary genes for the synthesis of 6dEB. Two are 
integrated into the genome, five are on two plasmids. 

[0189] Cell stocks were prepared by growing a culture of BAPl/pBP130/pBP144 in 
Luria Bertani medium (LB) supplemented with carbenicillin (100 mg/L) and kanamycin 

1 5 (50 mg/L) at 37°C and 250 rpm. After the culture reached an QD m between 0.5 and 1 , the 
cells were centrifuged and resuspended in half the initial culture volume using Fl medium 
with 8% glycerol. Fl medium contains: KH 2 P0 4 , 3 g/L; K 2 HP0 4 , 6.62 g/L; (NH 4 ) 2 S0 4 , 
4 g/L; MgS0 4 , 105.5 mg/L; Glucose, 5 g/L; Trace Metal Solution, 1.25 ml/L; and 
Vitamin Solution, 1.25 ml/L. (The trace metals solution contains: FeCl3 # 6H 2 0, 27 g/L; 

20 ZnCl 2 *4H 2 0, 2 g/L; CaCl 2 *6H 2 0, 2 g/L; Na 2 Mo0 4 *2H 2 0, 2 g/L; CuS0 4 *5H 2 0, 1.9 g/L; 
H3BO3, 0.5 g/L; cone. HC1, 100 ml/L; the vitamin solution contains: riboflavin, 0.42 g/L; 
pantothenic acid, 5.4 g/L; niacin, 6 g/L; pyridoxine, 1.4 g/L; biotin, 0.06 g/L; folic 
acid, 0.04 g/L.) Resuspended cells were aliquotted and frozen at -80°C. 

[0190] For small-scale experiments, 25 mL Fl medium in 250 mL flasks was 

25 inoculated with 0.5 mL of a pre-frozen glycerol cell stock, and grown at 37°C and 250 rpm 
(with carbenicillin and kanamycin concentrations at 100 and 50 mg/L, respectively) to a 
suitable OD^o (between 0.2 to 1, as specified below). At this point the culture was cooled 
to 22°C, induced with IPTG (isopropylthio-P-D-galactoside, GibcoBRL, Grand Island, NY, 
between 10 \xM and 10 mM, as specified below), and supplemented with sodium 

30 propionate (Sigma- Aldrich, St. Louis, MO, to a final concentration between 1 0 mg/L and 

10 g/L, as specified below). The cultures were then incubated at 22 or 25°C at 200 rpm 

until no further increase in production was observed. 

-56- 



WO 02/068613 PCT/US02/06399 



[0191] The effects of various growth and process parameters on the productivity and 
titer of 6dEB in cultures of E. coli BAPl/pBP130/pBP144 were studied under these 
conditions. Initially, three media that are compatible with fed-batch cultivation of E. coli 
were tested (data not shown). Of these, fermentation medium Fl gave the best yields 
5 (-1 mg/L over 19 hrs). The effect of IPTG and propionate concentrations, post-induction 
temperature, and time of induction was investigated using this medium. The optimal 
conditions found based on these experiments are as shown on Table 2. 



Table 2 

Variables Tested Under Shake-Flask Cultivation Conditions 



Variable 


Range Tested 


Optimal Level 


OD600 at Induction 


0.25-4.5 


Between 0.25 and 1.0 


IPTG 


10nM-10mM 


100 nM 


Propionate 


lOmg/L-lOg/L 


Between 50 mg/L and 2 g/L 


Post-Induction Temperature 


22°Cand 25°C 


No effect 



10 

For the best conditions, 6dEB titers were consistently found to be between 1-3 mg/L over 
19 hrs. 

Effect of Nutrient Stream, Induction Time and Propionate Feeding 

[0192] Fed-batch aerated fermentations were conducted using an Applikon 3L 
1 5 Biobundle system (Applikon Inc., Foster City, CA). A starter culture was grown in 

1.5 mL LB medium (at 100 mg/L carbenicillin and 50 mg/L kanamycin). After reaching 
late exponential phase at 37°C and 250 rpm, the culture was centrifuged and resuspended in 
50 mL LB (at 100 mg/L carbenicillin and 50 mg/L kanamycin). The culture grew 
overnight at 30°C and 200 rpm to stationary phase, was centrifuged, and resuspended in 
20 20 mL phosphate buffered saline (PBS) for inoculation into the 3L vessel containing 2L Fl 
medium which contained KH 2 P0 4 , 1.5 g/L; K 2 HP0 4 , 4.34 g/L; (NHLO2SO4, 0.4 g/L; 
MgS0 4 , 150.5 mg/L; Glucose, 5 g/L; Trace Metal Solution, 1.25 ml/L; and 
Vitamin Solution, 1.25 ml/L. Growth was conducted at 37°C with pH maintained 
throughout the experiment at 7.1 with 1M H2SO4 and concentrated NH4OH. Aeration was 
25 maintained at 2.8 L/min with agitation controlled at 600-900 rpm to maintain dissolved 
oxygen over 50% air saturation. The fermentation apparatus including salt solution 
(KH2PO4, K2HPO4, and (NH 4 )2S0 4 ) was autoclaved, whereas the additional feed 
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components (MgS04, glucose, trace metals, and vitamins) were filter-sterilized and added 
aseptically prior to inoculation along with carbenicillin at 150 mg/L and kanamycin at 
75 mg/L. The feed was also filter sterilized. Once the glucose was exhausted from the 
starting medium (as indicated by a sudden decrease in the oxygen requirement of the 
5 culture), the temperature was reduced to 22°C, and IPTG (100 \iM) and sodium propionate 
(2 g/L) were added. At that point a peristaltic pump started to deliver 0. 1 ml/mm of the 
Feed Medium which is: (NH 4 )2S0 4 , 1 10 g/L; MgS0 4 , 3.9 g/L; Glucose, 430 g/L; Trace 
Metal Solution, 10 ml/L; and Vitamin Solution, 10 ml/L, and samples were typically taken 
twice daily thereafter. Additional sodium propionate was added approximately every 48 

10 hrs (to 2 g/L, assuming no propionate remained) to avoid depletion of this precursor for 
polyketide biosynthesis. 

[0193] As shown in Figure 10A, cell growth was extended to a final ODeoo between 50 
and 70 (24 and 33 g dry cell weight/L) by the controlled addition of a nutrient feed stream 
that maintained glucose and acetate concentrations below 1 g/L throughout the course of 

15 the experiment. Polyketide biosynthesis was induced at an OD^o between 5 and 10 by 

addition of 100 nM IPTG and 2 g/L propionate at 22°C. Intracellular protein was analyzed 
periodically thereafter via SDS-PAGE, and the expressed levels of soluble PKS proteins 
were found to remain relatively constant throughout the length of the experiment 
(beginning approximately four hours after induction). This fermentation procedure yielded 

20 a final 6dEB titer of 70 mg/L over a 1 1 0 hour period. Precursor and product analysis 
showed that the titer reached a plateau after all the exogenous propionate had been 
consumed. As shown in Figure 10B, propionate supplementation in the fermentation 
medium reproducibly resulted in a titer increase to over 100 mg/L 6dEB. The maximum 
conversion of propionate into 6dEB was 6%. Fermenter and shake flask specific 

25 productivity was similar; however, the fermenter volumetric productivity was 17 fold 
higher than shake flask values. 

Effect of TEII 

[0194] The eryH gene from the erythromycin biosynthetic gene cluster in 
Saccharopolyspora erythraea encodes a homolog of thioesterase genes (also designated 
30 thioesterase II or TEH) Weber, J. M., et al f J. BacterioL (1990) 172:2372-2383. TEII has 
been shown to enhance polyketide biosynthesis in actinomyces hosts by an as yet unknown 
mechanism (Hu, Z., unpublished result). The eryH gene was cloned under control of an 
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IPTG-inducible T7 promoter on a chloramphenicol resistant plasmid, pGZl 19EH 
(Lessl, M., et al, J. BacterioL (1992) 174:2493-2500). This plasmid is compatible with 
plasmids pBP130 and pBP144. The resulting plasmid, pBP190, was co-transformed into 
BAP1 along with pBP130 and pBP144, and studied using the fermentation protocol 
5 outlined above (including chloramphenicol at 34 mg/L in the starter cultures and 20 mg/L 
in the fermenter). 

(0195] The growth characteristics of this recombinant strain mirrored those of previous 
experiments, but the titer of 6dEB doubled to -180 mg/L (Figure 1 1). These experiments 
suggest that TEH co-expression enhances the polyketide productivity of E. coli analogous 

10 to its effect in Streptomyces spp. (where 6dEB production also doubled compared to 
cultures without TEII [Hu, Z., unpublished result]). 

[0196] TEII coexpression described above did not significantly improve the plasmid 
stability properties of BAP 1 and the fermentation gene expression profile showed no 
qualitative difference when compared to fermentations without TEII expression. Unlike 

1 5 the DEBS and PCC proteins, TEH was not readily visible via SDS-PAGE analysis, but 
when expressed in isolation using shake- flask conditions meant to mirror fermenter 
conditions, TEH (-30 kD) could be readily purified using an N-terminal 6xhistidine tag. 
This suggests that, once expressed, TEII is present within the cellular background of 
E. coli. Small scale in vivo radioactive experiments Pfeiffer (2001) supra, also support the 

20 notion that titer increase was due to TEII coexpression. As noted above, E. coli, BAP1, 
pBP130, pBP144, and pBP190 was compared to BAPl/pBP130/pBP144/pGZl 19EH. 
Additionally, these two strains were compared to BAPl/pBP130/pBP144. Each 
comparison showed an -2 fold increase in 6dEB titer with TEII coexpression. 
[0197] In earlier work we have confirmed that E. coli BAP1 is unable to utilize 

25 propionate as a sole carbon source to support growth, as the only known pathway for 
. propionate catabolism has been deleted from this recombinant strain. However, since 
<10% of exogenous propionate is converted into 6dEB, it appears that propionate is 
catabolized into unknown and potentially undesirable side products. Since the ability of 
E. coli to activate propionate as propionyl-CoA has been retained (and intentionally 

30 amplified) in BAP1, it is possible that biosynthesis of odd chain fatty acid synthesis 

provides one route for the utilization of exogenous propionate. When radio labeled 

propionate is fed to E. coli BAPl/pBP130/pBP144, radio-labeled products which are even 

more lipophilic than 6dEB (as judged by radio-TLC analysis; data not shown) are observed. 
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Another possibility is the potential utilization of propionate via non-specific transacylation 
mechanisms that can utilize propionyl-CoA in lieu of acetyl-CoA (the most common acyl 
donor in vivo). The identification of such "dead-end" products could lead to the genetic or 
metabolic attenuation or elimination of such non-productive pathways from the E. coli host, 
5 thereby further enhancing its capacity for polyketide biosynthesis. 

[0198] Our results have demonstrated the utility of an accessory thioesterase present in 
Saccharopolyspora erythraea, the TEH, for enhancing the productivity of 6dEB in E. coli. 
The exact mechanism of the TEII is presently not clear, and it is believed that this enzyme 
plays an editing role by hydrolyzing incorrectly processed intermediates off the 

10 multifunctional PKS. Alternatively, the TEII may increase the intracellular activity of PKS 
enzymes by purging acyl carrier protein (ACP) domains that have been post-translationally 
modified with an inappropriate phosphopantetheine donor. Ordinarily ACP domains on 
PKSs are post-translationally modified by attachment of the phosphopantetheine arm at an 
active site serine. This reaction is catalyzed by a phosphopantetheinyl transferase. In 

1 5 BAP1 , a heterologous enzyme, Sfp, from Bacillus subtilis catalyzes this reaction. The 

primary advantage of using Sfp is that it possesses a broad substrate specificity for virtually 
any acyl carrier protein domain; however, since Sfp can utilize acyl-CoA donors with 
comparable specificity to CoASH, it may erroneously utilize substrates such as acetyl-CoA 
or propionyl-CoA. If so, the acylated phosphopantetheine would effectively block 

20 polyketide intermediates from being processed past this misprimed ACP domain until the 
thioester bond is hydrolyzed. To the extent the TE domain can selectively hydrolyze 
acetyl-ACP or propionyl-ACP domains in preference to the corresponding CoA thioesters, 
it may act as an enhancer of newly synthesized PKS activity in vivo. 
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Claims 

1 . Microbial host cells which are genetically modified for enhanced synthesis 
of at least one polyketide, wherein said modification comprises incorporation of at least one 
expression system for producing a protein that catalyzes the production of starter and/or 

5 extender units and/or disabling at least one endogenous pathway for catabolism of starter 
and/or extender units. 

2. A method to produce a polyketide which method comprises culturing the 
cells of claim 1 under conditions wherein said polyketide is produced. 

3. A method to assess the results of a procedure effecting modification of 
polyketide synthase genes, resulting in a mixture of said modified genes which method 
comprises 

transfecting a culture of cells of claim 1 with said mixture of modified genes, 
wherein said cells are E. coli> 

culturing individual colonies of said transformed E. coli> and 
assessing each colony for polyketide production 

4. A method to determine whether a substituted benzoate can prime an 
adenylation-thiolation (A-T) didomain of a rifamycin synthetase comprising 

incubating a substituted benzoate with a holo A-T didomain under conditions 
suitable for priming the A-T didomain; and 
20 measuring the amount or presence of the substituted benzoate that primed the A-T 

didomain. 

5. Procaryotic host cells which do not produce a polyketide in the absence of 
genetic modification and which are genetically modified for enhanced synthesis of at least 
one hybrid polyketide, wherein said modification comprises incorporation of at least one 

25 expression system comprising an A-T didomain, which incorporates a starter unit that 
primes an A-T didomain according to the method of claim 4. 

6. The procaryotic host cells defined in claim 5 wherein the starter unit is 

selected from the group consisting of 2-aminobenzoate, 3-aminobenzoate, 4- 
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aminobenzoate, 3-amino-5-hydroxybenzoate, 3-amino-4-hydroxybenzoate, 4-amino-2- 
hydroxybenzoate, 3-bromobenzoate, 3-chlorobenzoate, 3,5-diaminobenzoate, 3,5- 
dibromobenzoate, 3,5-dichlorobenzoate, 3,5-difluorobenzoate, 2,3-dihydroxybenzoate, 3,5 
dihydroxybenzoate, 3,5-dinitrobenzoate, 3-fluorobenzoate, 2-hydroxybenzoate, 3- 
5 hydroxybenzoate, 4-hydroxybenzoate, 3-methoxybenzoate, 3-nitrobenzoate, and 3- 
sulfobenzoate to make a modified polyketide. 

7. The procaryotic host cells defined in claim 5 wherein the starter unit is 
selected from the group consisting of 2-aminobenzoate, 3-aminobenzoate, 4- 
aminobenzoate, 3-amino-4-hydroxybenzoate, 4-amino-2-hydroxybenzoate, 3- 

10 bromobenzoate, 3-chlorobenzoate, 3,5-diaminobenzoate, 3,5-dibromobenzoate, 3,5- 
dichlorobenzoate, 3,5-difluorobenzoate, 2,3-dihydroxybenzoate, 3,5-dinitrobenzoate, 3- 
fluorobenzoate, 2-hydroxybenzoate, 4-hydroxybenzoate, 3-methoxybenzoate, 3- 
nitrobenzoate, and 3-sulfobenzoate to make a modified polyketide. 

8. A hybrid polyketide in which a starter unit is incorporated therein which 
1 5 starter unit primes an A-T didomain according to the method of claim 4. 

9. The hybrid polyketide defined in claim 8 where the starter unit is selected 
from the group consisting of 2-aminobenzoate, 3 -aminobenzoate, 4-aminobenzoate, 3- 
amino-5-hydroxybenzoate, 3-amino-4-hydroxybenzoate, 4-amino-2-hydroxybenzoate, 3- 
bromobenzoate, 3-chlorobenzoate, 3,5-diaminobenzoate, 3,5-dibromobenzoate, 3,5- 

20 dichlorobenzoate, 3,5-difluorobenzoate, 2,3-dihydroxybenzoate, 3,5-dihydroxybenzoate, 
3,5-dinitrobenzoate, 3-fluorobenzoate, 2-hydroxybenzoate, 3-hydroxybenzoate, 4- 
hydroxybenzoate, 3-methoxybenzoate, 3-nitrobenzoate, and 3-sulfobenzoate. 

10. The hybrid polyketide defined in claim 9 where the starter unit is selected 
from the group consisting of 2-aminobenzoate, 3-aminobenzoate, 4-aminobenzoate, 3- 

25 amino-4-hydroxybenzoate, 4-amino-2-hydroxybenzoate, 3-bromobenzoate, 3- 

chlorobenzoate, 3,5-diaminobenzoate, 3,5-dibromobenzoate, 3,5-dichlorobenzoate, 3,5- 
difluorobenzoate, 2,3-dihydroxybenzoate, 3,5-dinitrobenzoate, 3-fluorobenzoate, 2- 
hydroxybenzoate, 4-hydroxybenzoate, 3-methoxybenzoate, 3-nitrobenzoate, and 3- 
sulfobenzoate. 
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11. A method to produce a polyketide which method comprises culturing the 
cells of claim 5 under conditions wherein said polyketide is produced. 



12. The cells of claim 5 which are of the genus Escherichia, Streptomyces, 
Bacillus, PseudomonaSy or Flavobacterium. 

5 13. The cells of claim 1 2 which are E. coll 

14. The cells of claim 5 wherein said cells produce a complete polyketide 
derived from rifamycin, rapamycin, FK506, ansatrienin, FK520, microcystis pimaricin, 
erythromycin, oleandomycin, megalomycin, picromycin, spinosad, avermectin, tylosin or 
epothilone. 

10 15. The ceils of claim 1 4 which produce a modified rifamycin. 

16. The cells of claim 14 which produce a 6-dEB analog. 

1 7. The cells of claim 5, wherein said genetic modification further comprises 
incorporation of at least one expression system for a polyketide synthase protein. 

1 8. The cells of claim 5 wherein said genetic modification comprises 

1 5 incorporation of at least one expression system for a phosphopantetheinyl transferase. 

19. A method to enhance the production of at least one hybrid polyketide in a 
microbial host which method comprises providing said host with an expression system for 
producing a protein that incorporates an exogenous starter unit that primes an A-T 
didomain according to the method of claim 4. 

20 20. A method to enhance production of a secondary metabolite in a microbial 

host which method comprises inducing the production of said secondary metabolite after 
the microbial cells have reached a high cell density in the culture. 

21. The method of claim 20 wherein the secondary metabolite is a polyketide. 
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22. The method of claim 20 wherein said inducing is by supplying a compound 
which activates an inducible promoter. 

23. A method to enhance the production of a secondary metabolite in a 
microbial host which method comprises maintaining the nutrient level fed to said microbe 

5 at a relatively constant level during the production of said secondary metabolite. 

24. The method of claim 23 wherein said secondary metabolite is a polyketide. 

25. A method to enhance the production of a polyketide in a microbe which 
method comprises modifying said microbe to contain an expression system for a 
thioesterase II (TEII). 
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Figure 4 
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Figure 6 
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when the document is taken alone 

"Y* document of particular relevance; the claimed invention cannot be 
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BOX D. OBSERVATIONS WHERE UNITY OF INVENTION IS LACKING 

This application contains the following inventions or groups of inventions which are not so linked as to form a single general 
inventive concept under PCT Rule 13. 1 . In order for all inventions to be examined, the appropriate additional examination fees must 
be paid. 

Group I, claim(s) 1, drawn to microbial host cells for enhanced polyketide synthesis. 

Group II, claim(s) 2, drawn to methods of making polyke tides using microbial host cells for enhanced polyketide synthesis. 

Group III, claim(s) 3, drawn to methods to assess the results of a procedure effecting modification of polyketide synthase genes. 

Group IV, claim(s) 4, drawn to methods for determining whether a substituted benzoate can prime an adenylation-thiolation (A-T) 
didomain of a rifamycin synthetase. 

Group V, claim(s) 5-7 and 12-18, drawn to prokaryotic host cells modified to include an expressing system comprising an A-T 
didomain. 

Group VI, claim(s) 8-10, drawn to hybrid polyketides. 

Group VII, claim(s) 1 1 and 19, drawn to methods of making polyketides using prokaryotic host cells modified to include an 
expressing system comprising an A-T didomain. 

Group VIII, claim(s) 20-22, drawn to methods to enhance the production a secondary metabolite in a microbial host using high 
density cell culture. 

Group IX, claim(s) 23-24, drawn to methods to enhance the production a secondary metabolite in a microbial host using a constant 
nutrient level. 

Group X, claim(s) 25, drawn to methods to enhance the production a secondary metabolite in a microbial host using a thioesterase II 
expression system. 

The inventions listed as Groups I-X do not relate to a single general inventive concept under PCT Rule 13. 1 because, under PCT Rule 
13.2, they lack the same or corresponding special technical features for the following reasons: The technical feature of Claim 1 is the 
microbial cells that produce polyketides after having been modified to contain an expression system for a protein that produces an 
extender unit. This technical feature is not a special technical feature because the subject matter considered as a whole does not 
contribute to the prior art. Polyketide synthase gene clusters have been expressed in heterologous microbial hosts since at least the 
teachings of Kao et at. (Science (1994) 265:509-512). With this heterologous expression, the need to increase the supply of starter 
and extender units has been recognized since at least the teachings of Siassi et aL (PNAS (1998) 95:7305-7309). Thus, the subject 
matter of the Claim 1 does not contribute to the prior art and lacks a special technical feature. With that, each category of invention 
cannot share any special technical feature with Claim 1. The remaining claims are divided as to claim category (method, product, 
etc.) and subject mater. 
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Box I Observations where certain claims were found unsearchable (Continuation of Item 1 of first sheet) 



This international report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 

I. Claim Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



I | Claim Nos.: 

because they relate to parts of the international application that do not comply with the prescribed requirements to 
such an extent that no meaningful international search can be carried out, specifically: 



3. \^\ Claim Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 
6.4(a). 



Box H Observations where unity of invention is lacking (Continuation of Stem 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Continuation Sheet 



1 . J j As all required additional search fees were timely paid by the applicant, this international search report covers all 
searchable claims. 

2- I f | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite 
payment of any additional fee. 

3. [XI As only some of the required additional search fees were timely paid by the applicant, this international search 
report covers only those claims for which fees were paid, specifically claims Nos.: 1, 5-7, and 12-18 



4. 1 | No required additional search fees were timely paid by the applicant. Consequently, this international search report 
is restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

Remark on Protest The additional search fees were accompanied by the applicant's protest. 

I 1 No protest accompanied the payment of additional search fees. 
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